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Preface 



We describe in this book, hybrid intelligent systems using soft computing tech- 
niques for intelligent control and mobile robotics. Soft Computing (SC) consists of 
several intelligent computing paradigms, including fuzzy logic, neural networks, 
and bio-inspired optimization algorithms, which can be used to produce powerful 
hybrid intelligent systems. The book is organized in five main parts, which contain 
a group of papers around a similar subject. The first part consists of papers with 
the main theme of theory and algorithms, which are basically papers that propose 
new models and concepts, which can be the basis for achieving intelligent control 
and mobile robotics. The second part contains papers with the main theme of intel- 
ligent control, which are basically papers using bio-inspired techniques, like 
evolutionary algorithms and neural networks, for achieving intelligent control of 
non-linear plants. The third part contains papers with the theme of optimization of 
fuzzy controllers, which basically consider the application of bio-inspired optimi- 
zation methods to automate the design process of optimal type-1 and type-2 fuzzy 
controllers. The fourth part contains papers that deal with the application of SC 
techniques in times series prediction and intelligent agents. The fifth part contains 
papers with the theme of computer vision and robotics, which are papers consider- 
ing soft computing methods for applications related to vision and robotics. 

In the part of theory and algorithms there are 5 papers that describe different 
contributions that propose new models and concepts, which can be the considered 
as the basis for achieving intelligent control and mobile robotics. The first paper, 
by Ramon Zatarain et al., deals with applying intelligent systems for modeling 
students' learning styles used for mobile and web-based systems. The second pa- 
per, by Luis Martinez et al., deals with a fuzzy model for RAMSET: Role As- 
signment Methodology for Software Engineering Teams. The third paper, by 
Jorge Soria-Alcaraz et al., describes an academic timetabling design using hyper- 
heuristics. The fourth paper, by Alberto Ochoa et al., describes a logistics optimi- 
zation service improved with artificial intelligence. The fifth paper, by Francisco 
Arce and Mario Garcia- Valdez, describes an accelerometer-based hand gesture 
recognition system using artificial neural networks. 

In the part of intelligent control there are 5 papers that describe different contri- 
butions on achieving control using hybrid intelligent systems based on soft 
computing techniques. The first paper, by leroham Baruch et al., describes a direct 
and indirect neural identification and control of a continuous bioprocess via 
Marquardt learning. The second paper, by Eduardo Gomez-Ramirez et al., deals 
with a method for simple tuning of type-2 fuzzy controllers. The third paper, by 
Leocundo Aguilar et al., proposes an increasing energy efficiency of a preamble 
sampling MAC protocol for wireless sensor networks using a fuzzy logic 
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approach. The fourth paper, by Arnulfo Alanis et al., describes a multi-agent sys- 
tem based on psychological models for mobile robots. The fifth paper, by Fevrier 
Valdez et al., proposes the use of fuzzy logic to control parameters in bio-inspired 
optimization methods. 

In the part of optimization of fuzzy controllers there are 5 papers that describe 
different contributions of new algorithms for optimization and their application to 
designing optimal fuzzy logic controllers. The first paper by Ricardo Martinez 
et al., describes the optimization of type-2 fuzzy logic controllers using PSO ap- 
plied to linear plants. The second paper, by Yazmin Maldonado et al., deals with 
an approach for the optimization of membership functions for an incremental 
fuzzy PD control based on genetic algorithms. The third paper, by Leticia 
Cervantes and Oscar Castillo, describes a new method for the design of a fuzzy 
system for the longitudinal control of an F-14 airplane. The fourth paper by Abra- 
ham Melendez et al., describes a fuzzy reactive controller of a mobile robot. The 
fifth paper, by Arnulfo Alanis et al., describes a multi-agent system with personal- 
ity profiles and preferences and learning for autonomous mobile robot, with fuzzy 
logic support. 

In the part of time series prediction and intelligent agents several contributions 
are described on the development of new models and algorithms relevant to time 
series analysis and forecasting, as well as the application of intelligent agents in 
real-world applications. The first paper, by Pilar Gomez et al., describes compos- 
ite recurrent neural networks for long-term prediction of highly-dynamic time 
series supported by wavelet decomposition. The second paper, by Juan R. Castro 
et al., describes an interval type-2 fuzzy neural network for chaotic time series 
prediction with cross-validation and the Akaike test. The third paper, by Jesus 
Soto et al., deals with chaotic time series prediction using Ensembles of ANFIS. 
The fourth paper, by Lucila Morales et al., describes the modeling of facial ex- 
pression of intelligent virtual agents. The fifth paper, by Ivan Espinoza et al., de- 
scribes agent communication using semantic networks. The sixth paper, by Cecilia 
Leal-Ramirez et al., describes a fuzzy cellular model for predator-prey interaction 
applied to the control of plagues in a peppers cropping. 

In the part of computer vision and robotics several contributions on models and 
algorithms are presented, as well as their applications to different real-world prob- 
lems. The first paper, by Rogelio Salinas-Gutierrez et al., describes the use of 
Gaussian copulas in supervised probabilistic classification. The second paper, by 
Pablo Rivas et al., proposes subjective co-localization analysis with fuzzy predi- 
cates. The third paper, by Jesus David Teran et al., describes an iterated local 
search algorithm for the linear ordering problem with cumulative costs. The fourth 
paper, by Nohe Cazarez et al., describes an observer for the type-1 fuzzy control 
of a servomechanism with backlash using only motor measurements. The fifth pa- 
per, by Selene Cardenas et al., proposes a neuro-fuzzy based output feedback con- 
troller design for biped robot walking. The sixth paper, by Oscar Montiel et al., 
describes a fuzzy system to control the movement of a wheeled mobile robot. The 
seventh paper, by Oscar Montiel et al., proposes an approach for embedding a 
fuzzy locomotion pose controller for a wheeled mobile robot into an FPGA. 



Preface VII 

In conclusion, the edited book comprises papers on diverse aspects of bio- 
inspired models, soft computing and hybrid intelligent systems for control and 
mobile robotics. There are theoretical aspects as well as application papers. 
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Ramon Zatarain 1 , Lucia Barron-Estrada 1 , Carlos Alberto Reyes-Garcia 2 , 
and Orion Fausto Reyes-Galaviz 

1 Instituto Tecnologico de Culiacan, Mexico 

2 Instituto National de Astrofisica, Optica y Electronica (INAOE), Mexico, 

3 Universidad Autonoma de Tlaxcala 
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Abstract. The identification of the best learning style in an Intelligent Tutoring 
System must be considered essential as part of the success in the teaching process. 
This research work presents a set of three different approaches applying intelligent 
systems for automatic identification of learning styles in order to provide an 
adapted learning scheme under different software platforms. The first approach 
uses a neuro-fuzzy network (NFN) to select the best learning style. The second 
approach combines a NFN to classify learning styles with a genetic algorithm for 
weight optimization. The learning styles are based on Gardner's Pedagogical 
Model of Multiple Intelligences. The last approach implements a self-organising 
feature map (SOM) for identifying learning styles under the Felder-Silverman 
Model. The three approaches are used by an author tool for building Intelligent 
Tutoring Systems running under a Web 2.0 collaborative learning platform. The 
tutoring systems together with the neural networks can also be exported to mobile 
devices. We present results of three different tutoring systems produced by three 
implemented authoring tools. 

Keywords: Intelligent Tutoring System, Web 2.0, Authoring Tool, M-Learning. 

1 Introduction 

In an e-learning environment, teachers can design various alternatives of content 
for each possible configuration of learning style [1]. The characteristics of e- 
learning allow teachers to work collaboratively with students, and even with other 
teachers interested in designing learning materials. Thus, responsibility for the de- 
sign of teaching material does not lie with one person, but on a community of 
designers [2]. The time devoted to teaching every student is not a problem in elec- 
tronic learning environments. Each student can have access to materials virtually 
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4 R. Zatarain et al. 

at the time and place he wants [3], An implementation of e-learning, in which 
materials are selected according to student's learning style, can be defined in the con- 
text of an Intelligent Tutoring System (ITS) or in an Adaptive Hypermedia System. 
An ITS refers to any computer system that provides a personalized educational proc- 
ess directly to the student. The ITS works automatically and requires no intervention 
by the teacher (or student) to perform the customization of the learning material. To 
identify the learning style of students there are several solutions. One solution is the 
application of questionnaires designed by teachers specialized in the field. Another 
more accurate solution is to use artificial intelligence techniques for carrying out this 
process automatically. 

This work addresses the above problems through design and construction of a 
neural network, which after training, automatically and dynamically identify stu- 
dent learning styles. This neural network provides the means for customizing 
learning materials. The neural network can be used in any e-learning or m- 
learning environment, whether for identification of learning style in the context of 
an Intelligent Tutoring System, or manually with the student consulting its own 
learning styles. 

The arrangement of the paper is as follows: Section 2 gives a general structure 
of the tool. Section 3 presents the neural network and predictive engine used in 
the tool. Results are shown in Section 4. Discussions and conclusions are given in 
Section 5. 



2 Learning Style Models 

Models of learning styles categorized both the ways in which students learn and 
how teachers teach. Its main objective is that in each category holding the model, 
the learning needs of students are met [4]. Taking into account the majority of ex- 
isting models concerning learning styles, there are five major families [5]. Figure 
1 shows the first four classes of models, which relate to theories of learning styles, 
the fifth family, which is omitted in the figure, contains models that deviate from 
the concept of learning styles and propose other theories. 

The model of learning styles of Gregorc was presented by Anthony Gregorc 
and Kathleen Butler. The objective of the model is to provide an organized struc- 
ture of how the mind works. In this model there are four learning styles: Concrete 
Sequential, Abstract Random, Abstract Sequential and Concrete Random. Howard 
Gardner maintains the idea that every student is equipped with different types of 
intelligence. Gardner's theory is known as Multiple Intelligences, and provides 
that each individual has different intelligences at different levels. Gardner also 
states that every person has a unique cognitive profile. According to multiple 
intelligence theory, there are nine basic types of intelligence: Visual-spatial, 
Verbal-linguistic, Logical-mathematical, Bodily-kinesthetic, Musical-rhythmic, 
Interpersonal, Intrapersonal, Naturalistic, and Existential. The Myers-Briggs 
model is known as the Type Indicator Myers-Briggs (MBTI for short). The MBTI 
identifies four scales in which conforms to all subjects. The scales are extraversion 
/ introversion, sensitive / intuitive, thinking / feeling, and judgment / perception. 
Of all the combinations of the scales, we obtain 16 types of personalities. Each of 
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the combinations of types is described with four letters (one for each scale). The 
Felder-Silverman model was proposed by Richard Felder and Linda Silverman in 
1988 [6]. The model includes four dimensions or categories, two of which repli- 
cate features found in the models of Myers-Briggs and Kolb. The four dimensions 
in the model are related to perception (sensory / intuitive), processing (active / re- 
flective), input presentation (visual / verbal) and understanding (sequential / 
global). Learning styles are obtained by the combination of all categories. Thus, it 
is possible to have 16 different learning styles. 
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Fig. 1. Main Learning Style Models 

3 First Approach: Modeling Learning Styles with a NFN 

The first approach to identify students' learning styles was using a neuro-fuzzy 
network. In this section, we present MLTutor, an authoring tool which can be 
used to build personalized or intelligent tutoring courses to be used in both learn- 
ing settings: distance and mobile learning. The tool can be employed for develop- 
ing learning material using SCORM [7] Learning Objects or other standard file 
formats. The output of the authoring tool will be either learning material for 
mobile devices or SCORM learning objects for e-learning environments. The 
learning material for mobile tools uses a neuro-fuzzy mechanism to identify and to 
predict learning styles in order to provide an adapted learning scheme. The learn- 
ing styles are based on Gardner's Pedagogical Model of Multiple Intelligences [8]. 

3.1 MLTutor Architecture 



Figure 2 presents the architecture of MLTutor. As shown in Figure 2, the tool has 
two main editors: the content editor and the fuzzy set editor. An author creates a tu- 
toring system by first building a course structure using the content editor. This struc- 
ture consists of a number of units where a unit can be linked with learning material 
and some assessments. A course is created by importing already prepared learning 
material in different standard formats like html, pdf, doc or SCORM learning ob- 
jects from any type of source. The author can also introduce learning material by us- 
ing a simple editor included in the tool. Other important learning materials the authors 
insert into each one of the units are quizzes. These can be in every part of each 
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section. The quiz is essential for the dynamic courseware generation because from the 
test results, the neural networks classify learning styles. 

When the author introduces the learning material he/she creates four different in- 
stances corresponding to four different student learning styles (types) according to 
Gardner's Pedagogical Model of Multiple Intelligences: Logical/Mathematical, Ver- 
bal/Linguistic, Visual/Spatial and Musical/Rhythmic. There is a special interface in 
the content editor for helping the author when building this material. 

The fuzzy set editor helps the user to define Fuzzy Membership Functions. For 
Fuzzy Inputs there are seven linguistic variables defined for a user. They are: answer 
selection order, correct answers, quiz spent time, topic spent time, number of tries un- 
til correct answer, number of visits to a question, and number of visits to a topic. 
Each of the seven linguistic variables allows three different values: low, average, and 
high. A Fuzzy inference process is performed mapping input linguistic values to out- 
put multiple-intelligence styles. The output of the fuzzy set editor can be m-learning 
material in XML format along with a predictive engine that employs a neuro-fuzzy 
inference algorithm [9]. It operates online using present and former information for 
each individual learner. At the start of each learning unit, predictions are made as to 
what the learners preferred learning material is, based on former information built in 
the fuzzy set edition phase. Afterward, the neuro-fuzzy algorithm will learn from pre- 
sent information and will make adjustments, if necessary, to another best suited learn- 
ing style. Another option to the output of the fuzzy editor is to export the learning 
material to SCORM format. The benefit of this format is the availability of the mate- 
rial in any distance learning environment. When a mobile course is exported to a mo- 
bile device, a XML interpreter is added to the course. This interpreter has the job of 
displaying the material of the course into the mobile device, according to some 
chosen learning style. 
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FUZZY SET EDITOR 
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Fig. 2. MLTutor Architecture 
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3.2 Neuro-fuzzy Predictive Model 

Figure 3 shows part of the MLTutor Neuro-Fuzzy system (just two linguistic 
variables are shown) implemented in order to represent knowledge (seven linguis- 
tic variables, four multiple intelligences, and inference rules), to learn from former 
and current data, and to make adjustment to new learning styles. 
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Fig. 3. Part of the HLTutor Neuro-Fuzzy System 

As we can observe in Figure 3, the system configuration consists of input and 
output layers, and three hidden layers that represent membership functions and 
fuzzy rules. The complete input layer has seven neurons representing our seven 
linguistic variables. Every linguistic variable has three fuzzy sets (low, average, 
and high). The input layer sends out external crisp values directly to the next layer. 
The output of this layer is as follows: 

Vi = Xi , 

where x, and y ; are input and output respectively. 

Each neuron of the input layer is connected to three neurons on layer 2 (fuzzyfica- 
tion layer). In this layer crisp values from input layer are transformed to 
appropriate linguistic fuzzy sets (low, average, or high sets). Every neuron of 
layer 2 represents a fuzzy set for each one of the seven linguistic variables (see ta- 
ble 1). The output of the layer is the degrees of membership of each input value to 
each fuzzy set. Every neuron of Layer 3 or fuzzy rule layer represents a fuzzy rule 
(Rl, R2, R3, etc.). Every fuzzy rule neuron takes inputs from layer 2 neurons. The 
rule is evaluated by the fuzzy intersection or t-norm, which in this case is the 
product operation. The output of each neuron is described as: 



Yi — x li X x 2i X ■•• X x ki 



8 R. Zatarain et al. 

The output of layer 3 represents the weights of each one of the rules. The 
weights connecting layer 3 and layer 4 are changed or adjusted by training the 
neural network. The weight values are normalized on many adjustments, by divid- 
ing each weight into the greatest weight found on each one of the adjustments or 
iterations and is represented by 

Wni(p + 1) = Wj(p) / W^p), 

where w ni is the normalized weight, p is the last iteration, Wj is the weight of 
connection i, and w max is the greatest weight of the iteration. 

Table 1. Fuzzy sets for two Linguistic Variables 



TopicsSpentTime 


CorrectAnswers 


Range=[0 1000] 

NumMFs=5 

MFl='Low':'trapmf ,[-225-25 100 300] 

MF2='Medium-Low':'trimf ,[100 300 500] 

MF3='Medium':'trimf ,[300 500 700] 

MF4='Medium-High':'trimf ,[500 700 900] 

MF5='High':'trapmf ,[700 900 1025 1225] 


Range=[0 10] 
NumMFs=5 

MFl='Low':'trapmf,[-2.25 -0.25 1 3] 
MF2='Medium-Low': , trimf ,[1 3 5] 
MF3='Medium':'trimf ,[3 5 7] 
MF4= , Medium-High':'trimf ,[5 7 9] 
MF5='High':'trapmf,[7 9 10.2 12.2] 



Layer 4 or output membership layer takes inputs from the fuzzy rule neurons 
and merges them by using fuzzy union, which in our case is the algebraic sum and 
is defined as 



x li © x 2i 



ffi Xjj 



The outputs of layer 4 are the different learning styles (Logical/Mathematical, 
Verbal/Linguistic, Visual/Spatial and Musical/Rhythmic) produced in the fuzzy 
rule layer. 

Layer 5 or defuzzyfication layer is the output of our system. In this layer the 
sum-product composition method is used [10]. The output of this last layer is a 
recommended learning style for the student. The learning algorithm we used is 
back-propagation. Our algorithm takes a desired output (a default learning style), 
which is computed at the beginning of the iterations. Next, the algorithm computes 
the actual output and compares it with the desired output (which is dynamically 
adjusted according with the student test results). If there is a difference between 
the actual and the desired output, the error is calculated and propagated backward 
through the network, updating or adjusting the weights between connected neu- 
rons. The neural network was trained too using Matlab (version 7.1). Then, 
the trained neural network was implemented using Java along with the XML 
interpreter. 

3.3 Tests and Results 



We conducted a test of our tool by working with a group of different kinds of 
users (authors), mainly university professors, and college students. They produced 
personalized or intelligent tutoring courses for English Language (Elementary 
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school), Object-Oriented Analysis and Design, and a Basic Math Course. The 
courses dynamically adapt the content or style of instruction presented to the 
learner. Figure 4 shows some interfaces (snapshots) of the authoring tool when 
building the Basic Math course. As we can observe, one image shows the course 
tree structure (left-top); two images show the contents of the course under the 
Verbal/Linguistic type of Intelligence (right-top and left-bottom); another one 
illustrates the fuzzy set edition (right-bottom); and the last one (front image) 
illustrates a quiz edition. 
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Fig. 4. Interfaces of MLTutor and a Math Course 



Figure 5 presents some instances of the Math course running in a cell Phone 
(NHAL Win32 Emulator). The four mobile instances correspond (left-to-right, 
top-bottom) to Three-Chapter menu, Chapter-1 menu (contents and quiz), 
chapter-1 contents, and chapter-1 quiz. 
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Fig. 5. The Math Course in a Cell Phone 

4 Second Approach: A NFN and a Genetic Algorithm 

The second approach to identify students' learning styles was using a neuro-fuzzy 
network together with a Genetic Algorithm. In this section, we present EDUCA, a 
Web 2.0 software tool to allow a community of authors and learners to create, 
share, and view learning materials and web resources in an adaptive environment 
which combines collaborative, mobile and e-learning methods. EDUCA applies 
different artificial intelligence techniques like a neural network and a genetic algo- 
rithm for selecting the best learning style or a recommendation-web mining 
system for adding and searching new learning resources. 

4.1 EDUCA Architecture 



Figure 6 illustrates the overall architecture of EDUCA. As we can observe, there 
are two authors: the main tutor (a teacher or instructor) and the community of 
learners. The student or learner is an important author of the course and 
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participate actively adding learning resources to the courses. The learner has a 
user profile with information like the GPA, particular learning style, or recom- 
mended resources to the course. When the authors add learning material, they first 
create four different instances corresponding to four different learning styles 
according to Felder-Silverman Learner Style Model [6]. 

We implemented a fuzzy-neural network using the fuzzy input values previ- 
ously defined. The output of the network is the learning style for each student us- 
ing a course. We also implemented a genetic algorithm for the optimization of 
the weights used in the network. The network was trained for 800 generations us- 
ing a population of 150 chromosomes. In order to train the network, we created 
three set of courses for high school students. Each course was presented in four 
different teaching styles according to the Felder-Silverman model. When a mobile 
course is exported to a mobile device, a XML interpreter is added to the course. 
A SCORM file for the course can also be exported. 



EDUCA 
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Fig. 6. EDUCA Architecture 

Once a course is created, a Course Publication Module saves it into a Course Re- 
pository. Whenever a learner accesses a course, a recommender system implemented 
in EDUCA presents links or Web sites with learning material related to the current 
topic. Such material is stored in a resource repository of EDUCA, which was 
searched previously by using Web mining techniques implemented also in EDUCA. 

4.2 The Intelligent Module 



The Intelligent Module takes as an input the learning material for four different 
learning styles. Then, it creates a NFN used to classify the learning style of the 
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user/student and produces as an output, an adaptive course (a special type of 
Intelligent Tutoring System). An adaptive course is structured with two compo- 
nents: A XML file (it contains the learning material), and the XML Interpreter. 
The Interpreter uses the NFN as a dynamic classifier to show the learning material 
according to the best learning style of the user. 

The first layer of the NFN has 7 neurons (see figure 3) representing the 7 lin- 
guistic variables used in the classification of the learning style. Every neuron of 
layer 1 is connected to 3 neurons of layer 2 (fuzzyfication layer). Due to the fact 
that we are using triangular membership functions, the activation function for the 
layer 2 neurons is as follow: 

y i (1) = 0,ifx I (1) <a-b/2 

y/ 1 * =1-2 Ixj (1) - al / b, if a - b / 2 < x/ < a + b / 2 

y i (1) = 0,ifx I (1) >a + b/2 

where a and b are the centre and width of the triangle, xf * and v/ 7-1 are input and 
output of neuron i respectively. 

The output of layer 3 represents the strength of each one of the fuzzy rules. The 
best weight values between layer 3 and layer 4 are calculated using a genetic algo- 
rithm. Layer 5 is the output of the NFN. We applied a Centroid technique to make 
defuzzyfication. The value of the output is the learning style for the current student 
of the course. 

Training the NFN with a Genetic Algorithm 




At the beginning, we create a chromosome population with random values. 
Each decoded element of the population, represent the weights in the NFN (see 
figure 7). 



-0.6 0.5 -0.3 0.14 0.3 -0.4 0.63 -0.21 0.8 0.9 -0.1 



Fig. 7. Decoding the weights in the NFN into a chromosome 

For encoding the chromosome, the weights of the NFN are sorted using a 
Bucket Sort Algorithm, ordering first by layer, and then by neurons. The output of 
this algorithm is a chromosome. Each gene of the chromosome represents a 
weighted link in the NFN. 

To evaluate the chromosome's performance, we assign each weight (gene) con- 
tained in the chromosome to the links of a respective NFN. Then we test the 
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network with a set of training values. Last, the sum of squared errors is evaluated, 
which will be used as the fitness value of the tested chromosome. 

In order to train the network, we create three set of courses for high school stu- 
dents. Each course was presented in eight different styles for Felder-Silverman 
model. The chosen courses were Teaching Digital Photography, Eolic Energy, 
and Introduction to Computers. In order to get their best learning style we applied 
Felder-Silverman Index of Learning Style Questionnaire to every student. We also 
designed an exam to evaluate the course taken by the student. The input data to the 
network was the performance of the student under each course and the learning 
style of the taken course (randomly chosen). The Desired Output was the learning 
style calculated from Felder-Silverman Questionnaire. We made tests with two 
groups of 40 students. 

4.3 Using the NFN on Cell Phones 

Whenever a course is created and exported to a cell phone or PDA (using XML 
format), the NFN is also exported to those devices, along with a XML parser or 
interpreter. This program run any course stored in the mobile by reading the XML 
file where the course is stored. At the beginning, a learning style is assigned to the 
student or user of the course. Then, depending of the results from questions pre- 
pared to the students inside the learning objects, the learning material is adapted to 
the best learning style (Visual, Verbal, Sequential, etc.) of the student (the NFN is 
consulted in order to find the best appropriated learning style to the student). 
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Fig. 8. Dynamic sequencing of Learning Material 



An important feature is the possibility of tracing the different learning styles 
that a user or student follow during lesson learning. The interpreter of the course 
optionally displays the current student learning styles plus the values of fuzzy 
variables. This information is relevant for doing different types of analysis with 
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respect to behaviors of the students and the way they learn. Figure 8 shows two 
snapshots of a handheld device (a cell phone) showing values of each learning 
style in a range between -100 to +100 (left device) and a dynamic sequence of the 
style behavior during the execution of the course (right device). 
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Fig. 9. Creation of Learning Material for a Basic Math course 



4.4 Test and Results 



We tested the tool with 15 professors/teachers and their respective students of dif- 
ferent teaching levels. They developed different kinds of courses like a 
GNU/Linux course, a Basic Math Operation course, and learning material for 
preparation to the University Admission test EXANI-II. Each one of the courses 
had from 4 to 10 units of learning. They included evaluations on each unit. The 
students participated by reading, evaluating and adding material (Web resources) 
to the courses. We applied a questionnaire or survey to instructors and students in 
order to evaluate the effectiveness of EDUCA. More than 90% of them agree that 
they would like to use the tool for teaching their courses. This point is very impor- 
tant because the tool was developed to be used by instructors of any level of 
teaching (from elementary to college level). 
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Next, we present an example of how an author creates/updates learning 
material for a Basic Math course (figure 9). We first create the structure of the 
course (left-top). Then, we add learning material for each learning style (right- top 
and left-bottom). In this stage, we also assign fuzzy set values to each linguistic 
variable, and use recommended and actual resources for inclusion in the course. 
Last, we export and display the course (right-bottom). 

5 Third Approach: A Self-Organizing Map 

In this section, we present a new version of the authoring tool EDUCA using a 
self-organizing map or Kohonen network [11] to identify students' learning styles. 
Advantages of Kohonen networks include implementation simplicity, execution 
speed and a shorter training process; however maybe the most important advan- 
tage of these unsupervised neural networks is that they do not require an external 
teacher for presenting a training set. During a training session, the SOM receives a 
number of different input patterns (the student learning style obtained from the 
ILSQ, the course learning style, and the student's grade in the course), discovers 
significant features in these patterns (Felder-Silverman learning styles) and learns 
how to classify input. 

5.1 New EDUCA Architecture 

Figure 10 presents the new architecture of EDUCA. The most important modifica- 
tion in this new version is the SOM which substitutes the NFN used in the first 
version. The functionalities in this new authoring tool are very similar for the user 
with only a new editor for the creation of learning material. After the author cre- 
ates a course or intelligent tutoring system, she/he can save it and export it to a 
Mobile Learning format used to display tutoring systems on mobile devices. The 
saved/exported file will enclose three elements: a XML file corresponding to the 
learning style model, a predictive engine for navigation purposes, and the 
Kohonen Neural Network for learning style classification. 

Once a Tutoring System has been created, the module Published Course stores 
it in a Course Repository. We know that learners usually read tutoring systems 
stored in some kind of repository, but they also consult other learning resources in 
different web sites. For example a student who needs to implement a LR parser in 
a compiler course, could use the Tutoring System stored in the course repository, 
but she/he could also consult extra resources in the Web. This material found by 
the student would be rated and recommended to be added to the regular compiler 
course. EDUCA uses a Hybrid Recommendation System [12] which stores new 
resources into a Resource Repository. Last, EDUCA uses a Data or Text Mining 
Subsystem used to search resources in the Web. 
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Fig. 10. New Architecture of EDIT A 

5.2 Training the SOM 

An input vector in our neural network is described as: 

D = [d fs d a p] 

where D is the input vector, and it is formed by three elements: cL which is the 
student learning style identified by applying the Felder-Silverman ILSQ question- 
naire; d a which is the learning style used in the learning material read by the stu- 
dent; and p is the student grade obtained in the test when the student has learning 
style dfs and the course was offered using learning style d a 
Vectors df s and da are also composed as: 

d a = d fs = [c, C 2 C 3 ] 

where Cj c 2 c 3 represents three scores for Perception, Input, and Understanding 
Dimension in the ILSQ questionnaire. 

Figure 1 1 shows part of the Kohonen network architecture. We can see that the 
input layer has seven neurons, corresponding to three-dimension vectors dp and d a , 
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and the student grade p. Vector's input data vary between -11 to +11, which is 
massaged to the range -1 to +1 with the equation: 

Massaged value u = vi / Ivimaxl 

where vi is a ILSQ score, and vimax is the maximum ILSQ score value (+11). As 
we observe in figure 11, -3, an ILSQ score for visual/verbal learning style, is 
mapped to -0.272. On the other hand, the student grade, which varies between 
and 100, is also massaged to the range -1 to +1, with equation: 

vnj = vj x 2 / yjmax - 1 

where vj is the student grade or score, and vjmax is the maximum student grade 
(+100). For example, 70 is mapped to 0.4. 

The Kohonen layer has 1600 neurons arranged in a structure of a 40x40 hex- 
agonal grid. The output of the network consists of three signals or values ranging 
between -1 to +1. These data are then decoded to the ILSQ range (-11 to +11); 
they represent the learning style of the student. 
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Fig. 11. The Kohonen Network for Learning Styles 



The procedure to train the neural network consisted in three steps. The sample 
for this experiment was 140 high-school students. During the first step we applied 
the ILSQ questionnaire in order to identify the learning style of each student. In 
the second step we created three different introductory courses: Computer 
Science, Photography, and Eolithic Energy. For each course we produced eight 
versions, one for each learning style (visual/verbal, sequential/global, sens- 
ing/intuitive, and active/reflective). Randomly, the students received each of the 
three courses, in only one learning style. We recorded the assigned learning styles 
of each student in the three courses. In the third step and after the students took the 
courses, we applied tests in order to evaluate the performance of the students in 
each course. 
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With these three elements (student learning styles, course learning styles, and 
student evaluation or results) we created 140 input vectors for training the net- 
work. We trained the network with 6000 iterations, an initial neighbourhood ratio 
of 50% of the lattice size and a learning rate of 0.1. The network has 1600 neurons 
arranged in the form of a two-dimensional lattice with 40 rows and 40 columns, 
and every input vector has seven elements, as we established previously. 

5.3 Using the Kohonen Network 

As we explained before the network is applied to identify learning styles of stu- 
dents. The network is then exported together with the ITS and the predictive en- 
gine to the mobile (see figure 1). When the student is displaying an ITS in a time t, 
the learning material shown to the students is using a learning style g. Whenever 
the student answers a quiz as part of the learning contents, he receives a grade k. 

Therefore, the network r takes as input the learning style of the displayed con- 
tents, and performs a search to find the winner-takes-all (best-matching) neuron j x 
at iteration/?, using the minimum-distance Euclidean criterion: 



j x (p) = minllX-W J (p)ll=[S(x 1 



■w„)Y 2 



where n is the number of neurons in the input layer, m is the number of neurons in 
the Kohonen layer, X is the input pattern vector and W is the weight vector. 

After, finding neuron j x , the output of the network is the student learning style 
which is defined as 

g(t+l) = r(g(t),k) 

This process of identifying the student learning style is performed every time the 
student answers a quiz on the ITS. 
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Fig. 12. EDUCA Main Interface 
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5.4 Tests and Results 

The tool was used by a group of different users (authors), which consisted mainly of 
university professors/students. They produced different intelligent tutoring systems 
like Compiler Constructions, Introduction to Computer Science, teaching the Maya 
Language, and an introduction to Elementary mathematics. The intelligent systems 
were initially created by a professor and later the students were using and adding new 
learning resources with the recommendation system. Figure 12 shows the main inter- 
face of EDUCA for lecture 3 "Parsing" in a Compiler course. As we can observe, 
there are different kinds of learning objects, depending of the learning style, added to 
the topic. In our example, the arrow is pointing to a learning object which was added 
to be displayed only for "visual/sensitive/sequential/reflective" learners. The other 
objects are also displayed for different learning style combinations. 

Figure 13 presents three snapshots of the Compiler course running in a mobile 
phone (Sun Java Wireless Toolkit 2.5.2). The first mobile phone (left) shows 
learning material about the parsing topic. The second and third mobile phone 
gives us a trace of the students' learning styles (in three stages) along the course. 
This information is valuable in that it shows how students' learning styles are 
varying from the one is obtained with the ILSQ to the correct one which is 
obtained with help of the Kohonen Network. 



• 


e 


• 


V 


~ 


ft 


41 




1 


2 ABC 


2du 


4gw 


Sl«L 


iB '"• 


7pob 


Stuy 


9WXTI 


*• 





#- + 




• 





1 • 


t 


1 " 


ft 


< 




1 


2 IK 


3dm 




4&HI 


5 1« 


{jamo 


7pok 


8"« 


1 9 wm 


*• 





i »- + ) 


Ml F" 







* 


M 


• 


t 


- 


A 


<l 




1 


2» K 


3° Ef 


4«i 


j 5» 


i 9 mn ° 


7pqk 


] 8tuv 


9 ware 


*' 


i o 


#■♦ 



Fig. 13. The compiler course in a Cell Phone 
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6 Related Work 

Many approaches and implementations have been developed in recent years in 
order to model students' learning styles [13, 14, 15, 16]. Most of these approaches 
use Bayesian Networks, Linear Temporal Logic, or Neuro-fuzzy Networks. Learn- 
ing styles are calculated based only on the ILSQ questionnaire and none of these 
works are authoring tools. 

Since the beginning of research in the field of ITS, more than twenty authoring 
systems for ITS have been developed [17]. Those authoring tools can be classified 
according to the type of tutoring system they produce; for example the author tool 
SIMQUEST [18] produces "simulation-based learning" systems, IRIS [19] creates 
"multiple knowledge types" systems, and InterBook [20] generates Intelli- 
gent/adaptive Hypermedia. One shared feature of all of those tools is the separate role 
of authors and learners. For instance, in SIMQUEST one author creates a simulation 
model, learner interface, instructional design, and environment. In this case, the 
learner is never involved. With IRIS, the author produces an ITS by following two 
main phases: Specifying the general requirements of the tutoring system, and filling 
the learning contents of the ITS. Building adaptive educational material is a similar 
procedure: the developer has to create content objects and specify the links between 
them. Authoring tools for developing adaptive educational/tutoring systems like In- 
terBook, SIGUE [21], NetCoach [22], and Ale [23] differentiate only by the interface 
tools the authors operate. They use special markup languages or GUI interfaces. The 
author creates the learning environments and the learners read the learning contents. 

There are several authoring tools used to create mobile applications like 
MyLearning [24], Test Editor [25], or mediaBoard [26]. Some of them are more 
PocketPC's oriented and some are focused to quiz editing or game-based learning. 
None of these author tools however, have the ability to adapt to the user's learning 
style, nor have portability across different computer and operating system 
platforms. 

7 Conclusions and Future Work 

This paper presents different approaches for modeling Students' Learning Styles 
for two different models and different platforms of learning. The approaches were 
implemented under three different authoring tools that allow the production of 
personalized learning material (Intelligent Tutoring Systems) to be used under 
web-based and mobile learning environments. The software tools were imple- 
mented with Java version 1.6 and XML. The courses or Tutoring Systems pro- 
duced with the tools are platform independent and have been tested with different 
software emulators (NHAL Win32 Emulator, J2ME Wireless Toolkit and Sony 
Ericsson SDK), cell and smart phones (Sony Ericsson and Nokia). We present 
tests with some of those emulators. Currently empirical studies are taking place to 
examine the students' reaction to the learning material produced using EDUCA. 
Future work should focus on continuing testing with official and complete mate- 
rial for course programs in public and private schools in Mexico. 
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Abstract. Current studies related to performance and integration of Software 
Engineering teams, focus on efficiency boarding different factors such as team 
abilities, management, tools, methodologies, personality and roles. The present 
work is a follow up of RAMSET (Role Assignment Methodology for Software 
Engineering Teams) methodology that relates personality, abilities and software 
roles for integration of Software Engineering Teams, applying sociometric and 
psychometric techniques under a Fuzzy Approach due measurements' subjective 
appreciation. A Fuzzy Model System is proposed to identify the best Role to be 
performed by a Software Engineer in Software Development Project Teams. 

Keywords: Fuzzy Logic, Software Engineering, Psychometrics, Sociometrics. 

1 Introduction 

Software development is an impressively complex socio-technical activity. It 
requires people to interact with each other and with technical methods and com- 
puting technologies they use to perform their work [1]. We must integrate a group 
of persons taking into account abilities and capacities for software development 
process. Each member contributes in a team as part of the total necessary capaci- 
ties for team success, which is why each individual must do activities where they 
are most experienced in performing a specific role in the work team. 

In a working team it is not only important to know capacity, skills, techniques and 
experience of each member, it is also fundamental to know the sociopsychological 
characteristics and personality of each one to form the team [28]. 

Achieving and maintaining excellence in modern corporations requires an ef- 
fort of combining talents from different individuals (strengths and weaknesses) to 
accomplish structure, integration and consolidation of working teams. 

This paper introduces a technique that relates personality, abilities and team 
roles in a software developing team. RAMSET (Role Assignment Methodology for 



O. Castillo et al. (Eds.): Soft Computing for Intell. Control and Mob. Robot., SCI 318, pp. 23 
springerlink.com © Springer-Verlag Berlin Heidelberg 201U 



24 L.G. Martinez et al. 

Software Engineering Teams) is based in personality tests with a Fuzzy Model Sys- 
tem approach in role assignment for software engineering teams. Personality tests 
are based on interpretation; therefore to tackle uncertainty a fuzzy model can bring 
us a more appropriate decision for recommending and assigning a software role. 

The paper is organized as follows: section 2 introduces fundamental aspects of 
personality and software engineering roles in working teams with background re- 
lated with the projective personality Tree Test. Section 3 describes RAMSET steps, 
section 4 a Fuzzy Model System is proposed for the personality Tree Test in par- 
ticular. Section 5 shows the results of our work and relationships between linguistic 
variables of the proposed model. Finally section 6 concludes with observations and 
experiences obtained. 

2 Background 

2.1 Personality and Role Assignment Relationship 

Human Psychology studies are old, before the 80's personality aspects were con- 
sidered of low value in personnel selection. Personnel selection methodologically 
chooses individuals for the right job. The first tests [24] used for a programmer's 
performance prediction and personnel selection in software development projects 
where PAT test (Programmer Aptitude Test), WPT test (Wonderlic Personnel Test) 
and PMA test (Primary Mental Abilities). In personality analysis, different and 
popular tests exist, like Jung's, Keirsey and Myers-Briggs Type Indicator (MBTI) 
tests. Rutherford [29] manages Personality Inventories in his Software Engineering 
classes to integrate heterogeneous teams, obtaining good results in teams' perform- 
ance and individuals' growth. Karn and Cowling [17] also implemented personality 
tests in Sheffield University England, documenting personality effects in Software 
Engineering Teams performance, mentioning that is a vast area and many subjects 
to consider relating software engineer's personality and their teams. 

Empirical studies investigating integration and building of Software Engineer- 
ing Teams focus in different factors that influence their performance like team ab- 
ilities, team administration, efficiency, development methods, diversity, size, 
genre, personality [11] and roles [7]. Pieterse and Kourie [27] reflect on the per- 
formance and diversity of Software Engineering Teams from the University of 
Pretoria where they applied personality tests, establishing metrics to build the 
teams. These metrics were ability, diversity and performance, concluding that not 
only diversity must be taken into account but also effective leadership, communi- 
cation and cohesion. Hogan and Thomas [13] emphasize about team work in their 
Software Engineering Courses where they have helped students to be better 
professionals teaching them work team abilities. Beranek [3] has been defining 
Functional Team Roles examining an informal distribution of team roles between 
students, whereas they measure their abilities and attitudes, working styles and 
behavior, based on Belbin [4] Role Tests, complementing them with students 
opinion's about their preferences and working attitudes. Feldt et al. [8] corroborate 
that empirical studies should collect psychometrics focusing in correlating 
personality ant attitudes to software engineering processes and tools. 
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Personality type was measured with MBTI (Myers-Briggs Type Indicator) 
using Keirsey Temperament Sorter by Gorla and Lam [11], their survey study 
shows a relationship between Software Engineering Roles and MBTI dimensions, 
primarily analyzing team leaders, analysts and programmers. These experiences 
resemble the findings by Capretz [5] in surveying people in the US working in 
software engineering. 

Table 1, Personality and Role Relationship by Gorla and Lam 



Criterion 


Personality dimension 


Team performance 


Team leader 


Information gathering 


Intuitive > Sensing 


Decision making 


Feeling > Thinking 


System analyst 


Decision making 


Thinking > Feeling 


Programmer 


Interaction with the 
world 


Extrovert > Intro- 
vert 



Shen [30] and colleagues have applied Myers-Briggs Type Indicator (MBTI) 
and Kiersey Temperament Sorter to form engineering design teams based on 
Wilde's [35] own method of selection with this tests. They recommend a Sensing- 
Intuitive (SN) type as the most important preferred type linked to creativity for se- 
lection of engineers in a leader role for design process. Also Sodiya et al. [31] 
from Nigeria have developed a tool integrating personality traits proposing a Cog- 
no-Personality Assessment Model for Software Engineering (CPAMSE) relating 
the Big Five Traits with roles and activities of engineers in software engineering. 
Fuzzy based approaches have been considered like Lather's [20] fuzzy model to 
evaluate suitability of Software Developers, also Ghasem-Aghaee and Oren's 
[25] [26] use of fuzzy logic to represent personality for human behavior simula- 
tion. Consequently everyone encouraging engineering educators to make greater 
use of type theory when selecting and forming engineering design teams and dele- 
gating team roles, in benefit of achieving productivity and efficiency in team 
performance, themes open to investigation. 

2.2 Graphology Tree Test 

In psychology the Tree Test is a projective test or graphical test. Emil Jucker [18] 
initiated the legacy of this psychodiagnostic test. Later Karl Koch [19] contributes 
Tree Test formulations, interpretations related in graphology as quality strokes, 
zone and sheet placement. Every graphical product instates the psychic life of an 
individual. Jung's tree symbology makes us consider it as a life symbol, of growth 
and development. He states "the growth from bottom up and viceversa, the mater- 
nal aspect: protection, shadow, cover, fruits, and fountain of life" [14]. Carl G. 
Jung also stated "In the unconscious leaving context, the trees are like fish in the 
water" [15]. Thus trees are symbolically associated with man's inner person and 
can describe their growth and emotional stability. 
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Fig. 1. EGO relationships expressed in Tree 



The tree expresses the relationship between Id, Ego and Super-Ego. The Id 
comprises the unorganized part of personality structure that contains the basic 
drives, everything that is inherited, that is present at birth, therefore, instincts 
which originate from the somatic organization, and which find a first psychical 
expression here (in the id) in forms unknown to us [9]. The Ego comprises the or- 
ganized part of personality structure that includes defensive, perceptual, intellec- 
tual-cognitive, and executive functions. The Ego is that part of the Id which has 
been modified by direct influence of the external world. The Ego mediates among 
the Id, the Super-Ego and the external world. Its task is to find a balance between 
primitive drives and reality while satisfying the id and super-ego. Its main concern 
is with the individual's safety and allows some of the id's desires to be expressed, 
but only when consequences of these actions are marginal. The Super-ego aims 
for perfection. It comprises that organized part of the personality structure, mainly 
but not entirely unconscious, that includes the individual's ego ideals, spiritual 
goals, and psychic agency (commonly called 'conscience') that criticizes and 
prohibits his or her drives, fantasies, feelings, and actions. 

Perfect equilibrium of this personality instances assures a psychic stability 
while their disproportion suggests a pathology appearance. The Tree Test presents 
an observation type evaluation of the Id strength level, emotional stability level 
and autoevaluation degree. Id strength level permits the subject reality assertion, 
persevere in his objectives and goals, to overcome and resist pressures and frustra- 
tions of his surroundings. Emotional stability's level is the presence and degree of 
conflicts associated to susceptibility, vulnerability, sensibility, rigidness, adapta- 
bility, and so on. Autoevaluation degree refers to how the subject finds itself 
in that moment, his reality criteria, strength, and ability to control impulses and 
emotions. 

The tree's crown represents the subject's fantasies, mental activities, his 
thoughts, spirituality and reality conception, it covers foliage and branches. The 
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unconscious world or instincts are symbolized in the root. Renate Griffiths [12] 
states "the root tree is the origin of it, is the symbol of its fountain of life". Finding 
the roots on the third or inferior zone, it tells us of the material, physical, earthly 
life, sexuality, and basis for his reality criteria. 

A personality Tree Test does not define us a total projection or whole image of 
the personality, but sheds valuable information of the sketcher. Relevance and me- 
rit of applying this test is a combination with other tests, which is why is just one 
of many tests used in RAMSET to reveal personalities of working team members 
and assign them the best advisable performing role. 

2.3 Fuzzy Model System 

Fuzzy sets and fuzzy logic are effective techniques for handling fuzzy uncertain- 
ties with well-developed mathematical properties. Imprecise concepts are attrib- 
utes of which we have a cognitive perception, yet are impossible to define 
precisely like in a defined model. That is why fuzzy logic provides an excellent 
way to represent and process linguistic variables. 

A non-fuzzy method weakness is dealing with imprecision and uncertainty in 
handling linguistic terms. Fuzzy set theory provides a natural method for dealing 
with the linguistic terms by which an expert will describe a domain [16]. It is an 
established fact that fuzzy model is the best choice for managing ambiguous, 
doubtful, contradicting and diverging opinions. All the way it's a better choice for 
very high complexity and nonlinearities. Fuzzy systems are intuitive and numeric 
systems, mapping input and output data. 

Fuzzy Inference Systems (FISs) also called Fuzzy Models are fuzzy production 
systems used for modeling input-output relationships. From this input-output 
view, Babuska [2] describes these systems as "flexible mathematical functions 
which can approximate other functions or just data (measurements) with a desired 
accuracy". Fuzzy Production Rules define the relationship between input and out- 
put variables. Input variables are defined in the antecedent part of the rule and the 
consequent part defines the output variable. 

Figure 2 represents these FIS that are basically composed of five modules: 

1) Rule Base: the set of fuzzy production rules. 

2) Database: where the membership functions are defined. 

3) Fuzzy Inference Engine: executes the fuzzy inference operations. 

4) Fuzzifier: transforms the inputs-numerical data into linguistic values. 

5) Defuzzifier: transforms the fuzzy results into numerical data. 
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Fig. 2. Fuzzy Model System Structure 
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A fuzzy system is associated with a set of rules with meaningful linguistic 
variables, such as (1) 

R : if X\ is F/ and x 2 is F 2 and . . . x n is F n then wis G 1 (1) 

Actions are combined with rules in antecedent/consequent format, and then aggre- 
gated according to approximate reasoning theory, to produce a nonlinear mapping 
form input space U = Uj x U 2 x U3 x ... x U n to the output space W where F^ , k = 
1,2, ..., n are the antecedent membership functions, and G ' is the consequent 
membership function. Input linguistic variables are denoted by Uk , k = 1,2, ..., n , 
and the output linguistic variable is denoted by w. 

The most resorted fuzzy inference models are those of Mamdani and 
Takagi-Sugeno [32]. Mamdani is direct and simple in describing empirical knowl- 
edge, has clarity in significance of linguistic variables and design parameters. 
Takagi-Sugeno enhances a simpler process using first degree equations in most of 
its applications, at a cost of less clarity in linguistic variables significance. Mam- 
dani fuzzy rules take the form (2), where x and y are activated variables of the 
membership function, z is the consequent fuzzy variable and the connective AND 
the conjunction operation with the antecedent. 

IF x is Xo AND y is Yo THEN z is Zo (2) 

It is in this context we present here a fuzzy based approach that provides an inte- 
grated quantity measure for abilities of software development personnel which 
incorporates all aspects of personality traits involved for role assignment. 



3 RAMSET Methodology 

In Software Engineering Courses development of projects by work teams is usu- 
ally implemented applying the traditional Software Life Cycle. Classic software 
engineering courses consisting of lecture notes and Waterfall Model application 
for project development is being substituted by more practical methods emphasiz- 
ing in helpful tools and agile processes, as divulged by North Carolina State 
University [21]. 

At the University of Baja California, Tijuana Mexico teaching of Software 
Engineering in our Computer Engineering Program is being conducted with de- 
velopment of real software projects applying RAMSET: Role Assignment Meth- 
odology for Software Engineering Teams based on personality. What is unique 
about RAMSET is a combination of Sociometric techniques, Psychometrics and 
Role Theory in Software Engineering Development Projects, this methodology 
consists of the next steps: 

a) Survey for abilities and skills. 

b) Implementation of Personality Tests. 

c) Execute Personal Interviews. 

d) Implementation of Sociometric Technique. 

e) Assignment of Team Roles. 

f) Follow up of Team Role fulfillment 
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RAMSET [22] methodology begins with a student's survey enumerating related 
courses of software engineering he has taken, to know which programming lan- 
guages and data base managers he is expert in. The next step is a series of person- 
ality tests; they could be Jung, Myers-Briggs, Big Five, Keirsey, Tree Test or 
similar tests. 

Subsequently we make an informal interview to know different aspects of his 
personality, what he likes to do, how he perceives himself at the end of his career 
studies, how he develops in the real world individually and with others, how he 
would like to participate in a team. After that a sociogram technique is applied to 
identify affinity for integration of teams. Based on test results and interview in- 
formation a team role is recommended to the instructor so individual members of 
each team develop a specific team role with all its functions. Figure 3 is an exam- 
ple of 16 participants that formed 6 different teams and worked on a real software 
project [23]. 

Jung Test's different personality dimensions are associated to different career 
and jobs corresponding to individual personality types [34]. Trait (E) denotes 
Extraversion, (I) Introversion, (S) Sensitive, (N) Intuitive, (T) Thought, (F) Senti- 
ment, (J) Judging and (P) Perceiving. Jobs associated with engineering and soft- 
ware are: ISTJ type for jobs like engineer, programmer, chief information officer; 
INTJ type includes programmer and informational graphics; INTP type software 
designer, systems analyst, computer programmer, data base manager; ISTP type 
includes computer repair person, computer programmer, software developer; 
ESTP takes in entrepreneur, technical trainer, analyst; ENTJ an administrator and 
program designer. 

Aim of student's survey for abilities and skills is to obtain homogeneity, diver- 
sification and efficiency of the team. Teams formed in Figure 3 Case Study were 
based on Moreno's sociogram technique and tried to have an ESTJ personality 
member in each team as marked by red arrows, role recommended for this ESTJ 
personality is an Analyst. 



ESTJ 




UD-0H© © 







Fig. 3. Team Distribution Case Study Example 



Team roles adopted were selected from team roles defined by Tomayko [33]: 
architect, responsible for project creation, coordination and supervision; analyst, 
responsible for finding and following up on resources, requirement analysis and 
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specifications; developer-programmer, responsible for implementation and code 
design; tester, responsible for tests and evaluation of the system; document spe- 
cialist, responsible for compiling every document for evidence and defining do- 
cumentation standards; and image and presentation role as a representative in 
charge of selling and promoting the product. 

We can choose from a set of different Personality Tests all revolving on the 
dimension types or personality traits that display the behavior of the individual 
member of the team. Combining these tests we can acquire the most valuable 
information for decision making in assignment of roles. 

4 Tree Fuzzy Model System 

Personality's Tree Test throws subjective information, based on the point of view 
and perception of the evaluator. That is why we propose a Fuzzy Model System 
that can shed numerical values that solve the output uncertainty. Fuzzy Inference 
Systems are based on Fuzzy Set Theory [36] permitting the incorporation of an 
uncertainty component that makes them more effective for reality approximation. 
Linguistic variables are used to manipulate imprecise qualitative and quantitative 
information; the linguistic variable is a variable whose values are not numbers but 
words or sentences in a natural or artificial language [6]. 

A linguistic variable is characterized by a quintuple (x, T(x), U, G, M), in 
which x stands for the name of the variable, T(x) denotes the set of x of fuzzy va- 
riable values, ranging over a universe of discourse U. G is a syntactic rule for gen- 
erating names of x, and M is a semantic rule for associating each x to its meaning 
being a subset of U. 

For our Tree Test FIS we selected three linguistic variables: (R) Root, (T) 
Trunk and (F) Foliage. These input variables enter the FIS model and obtain an 
output variable result of the Role (Q). Each linguistic variable is introduced to a 
filter of sketch recognition identifying the variable's input value for the FIS's 
Role. Therefore the model has three input and one output variables as seen in 
figure 4. 



R- 

ROOT 



TRUNK 



> FIS 



-> Q 



ROLE 



F_l 

FOLIAGE 



Fig. 4. Tree Test FIS Model 
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Psychodiagnostics analyze specific characteristics from the drawing. For Root 
we can select sketching type and size, the root represents the past and reflects per- 
son's dependency. For Trunk we can select form, area, height, sketch intensity and 
curvature, the trunk depicts the present and reflects person's affectivity. For Foli- 
age we can select form, size and extra features, it symbolizes achievements or 
goals reached. As a first model proposal the Tree Fuzzy Sets were defined as 
follows: 

The Set of Linguistic Variables for Root is: R(x) = {null, none, with}. When 
there is no sketch of any root Null is defined as the attribute, if the root is hidden 
the attribute is None, and any sketch of roots the attribute has been defined as 
With. 




Fig. 5. Root Recognition FIS 

The Set of Linguistic Variables for Trunk is: T(x)= { straight, wave, trapeze }. 
When the sketch of the trunk is two parallel lines the attribute is defined as 
Straight, if one or two of the trunk lines are curved the attribute is Wave, and two 
straight or curved lines with a wider bottom than the top the attribute is Trapeze. 




Fig. 6. Trunk Recognition FIS 



The Set of Linguistic Variables for Foliage is: F(x)= { circular, cloud, fruit, 
null } . Just a round sketch of foliage the attribute is defined as Circular, if it has 
wavy contour with or without faint sketches inside the attribute is Cloud, if it has 
any fruit the attribute is Fruit, and any sketch of only branches or leafs the 
attribute is Null. 
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Fig. 7. Foliage Recognition FIS 

The Set of Linguistic Variables for the output Role is: Q(x)= { Analyst, Archi- 
tect, Developer-Programmer, Documenter, Tester, Image and Presenter } . 

Labels were assigned to each attribute of later sets, and consecutive values 
starting on one were also assigned. Figure 8 illustrates attribute's membership 
functions of linguistic variables Root (R), Trunk (T), Foliage (F) and Role (Q), 
displaying intervals for each label. 




Fig. 8. Input and Output Membership Functions 
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With these intervals in combination with the Set of Rules obtained from 
conclusive results {3} we define RAMSET's Tree Test Fuzzy Model application. 

5 Results 

Since 2007-2 until 2009-1, 56 software engineers have worked in real projects, as- 
signing 88 roles in software development teams. Of these 15 have been assigned 
as Analysts, 15 Architects, 20 Developers/Programmers, 14 Documenters, 15 
Tester and validator, 9 Image and presentation. 

Table 2 shows Roles and MBTI types member relationship's of the teams. 
ENFJ, ENFP, INTP, INFJ, INFP types have not come up in the working samples. 
We can observe that personality type traits with higher percentage are (E) extro- 
verted 70%, (S) sensing 74%, (T) thinking 66% and (J) judging 81%, traits to be 
considered for role assignment without a question. Traits as (I) introverted, (N) 
intuitive, (F) feeling or (P) perceptive are of lower value, only in combination with 
other traits can be considered of importance for role assignment. 

Table 2. MBTI types and Roles Frequency 



IP 



J2 13 14 15 16 17 18 19 110 111 TOT 



Analyst 


7 


2 


4 


1 






















15 


Architect 


7 


4 





1 







1 


1 











15 


Developer 
Programmer 


2 


2 


1 


5 


3 







1 


2 


2 


1 


20 


Documenter 


5 


2 


1 


1 


1 




1 


1 


1 








14 


Tester 


5 


3 


4 





1 




1 














15 


Image and 
Presenter 


2 


3 


2 





1 





1 














9 


Totals 


28 


16 


12 


8 


6 


5 


4 


3 


3 


2 


1 


88 


*I1=ESTI, I2=ENTJ, I3=ESFI, I4=ISTP, I5=INTI, J6=ISTJ, I7=ISFI, I8=ESFP, 
I9=ISFP, I10=ESTP, I11=ENTP 



By implementing a fuzzy model the 16 jungian types are considered as antece- 
dents relating them with the consequent resulting role. Using weighted mean me- 
thod a specific weighted factor was assigned in order of importance, by this 
the data instead of contributing equally to the final average, some data points 
contribute more than others. 

Taking the Analyst role as an example factors Jl, 13, J2, J4, J6, J5, J7, J8, J9, 
J 10 and Jl 1 were put in decrementing order giving the first a value of 11 as seen 
in Table 3. The value (V) was multiplied by its frequency factor (F). This result 
was then divided by the sum total obtaining the final weight of each factor. 
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Table 3. Analyst Role weights 



(V)alue 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 


TOT 


(F)requency 


7 


4 


2 


1 


1 




















15 


V*F 


77 


40 


18 


8 


7 




















150 


Final 
Weight 


.513 


.267 


.12 


.053 


.047 




















1 


Attribute 


Jl 


J3 


J2 


J4 


J6 


J5 


J7 


J8 


J9 


J10 


Jll 





Applying the same method to attributes' frequencies from table 1 viewing 
MBTI types and Software Engineering Roles performed Table 4 is obtained. 

We observe that ESTJ type is a highly qualified individual to perform different 
roles; our stats recommend the Analyst or Architect role. For an 1STP individual 
Developer or Programmer is best fitted, an ESTJ for Tests although others can 
perform this role. And for our presenter and image we recommend an ENTJ, it can 
also be ESTJ or ESFJ. Viewed by those not recommended, types ESFP, ISFP, 
ESTP, ENTP should not be assigned as analysts, architects or testers and ENFP, 
INTP, INFJ, INFP haven't been considered for any role. 

Table 4. Software Engineer Roles weights 





Jl 


J2 


J3 


J4 


J5 


J6 


J7 


J8 


J9 


J10 


Jll 


ANA 


.513 


.12 


.267 


.053 





.047 

















ARC 


.524 


.272 





.061 





.054 


.048 


.041 











DEV 


.113 


.101 


.031 


.346 


.189 


.025 





.019 


.088 


.075 


.013 


DOC 


.47 


.171 


.077 


.068 


.051 


.06 


.043 


.034 


.026 








TST 


.385 


.189 


.28 





.056 


.049 


.042 














PRS 


.233 


.384 


.209 





.093 





.081 















Analyzing by attributes the ESTJ is an analyst and architect being (E) extro- 
verted and with good (J) judgment to relate with other people and take important 
decisions. The developer or programmer is an ISTP commonly (1) introverted and 
(T) thoughtful to his work with logic for problem solving. 

We analyzed the Tree Test in the same periods of time, having 74 drawings of 
trees obtaining weights for the attributes sketched using same mean weights me- 
thod. Table 5, 6 and 7 are the universe of weights for Root, Trunk and Foliage 
respectively. 



Table 5. Root's weights 



R(x) 


ANA 


ARC 


DEV 


DOC 


TST 


PRS 


Rl 


0.103 


0.182 


0.441 


0.030 


0.067 


0.050 


R2 


0.276 


0.636 


0.441 


0.727 


0.333 


0.200 


R3 


0.621 


0.182 


0.118 


0.242 


0.600 


0.750 
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T(x) 


ANA 


ARC 


DEV 


DOC 


TST 


PRS 


Tl 


0.174 


0.174 


0.25 


0.091 


0.097 


0.316 


T2 


0.652 


0.174 


0.656 


0.455 


0.452 


0.632 


T3 


0.174 


0.652 


0.094 


0.455 


0.452 


0.053 






Table 7 


. Foliage' 


i weights 






F(x) 


ANA 


ARC 


DEV 


DOC 


TST 


PRS 


Fl 


0.225 


0.153 


0.340 


0.243 


0.243 


0.130 


F2 


0.6 


0.307 


0.545 


0.540 


0.162 


0.695 


F3 


0.15 


0.512 


0.090 


0.162 


0.540 


0.087 


F4 


0.025 


0.025 


0.022 


0.054 


0.054 


0.087 



From this data we can obtain the Set of Fuzzy Rules annotated below: 



Set of Rules (3) 






IF R is R3 AND T is T2 AND F is F2 


THEN 


QisQl 


IF R is R2 AND T is T3 AND F is F3 


THEN 


QisQ2 


IF R is Rl AND T is T2 AND F is F2 


THEN 


QisQ3 


IF R is R2 AND T is T2 AND F is F2 


THEN 


QisQ3 


IF R is R2 AND T is T2 AND F is F2 


THEN 


QisQ4 


IF R is R2 AND T is T3 AND F is F4 


THEN 


QisQ4 


IF R is R3 AND T is T2 AND F is F3 


THEN 


QisQ5 


IF R is R3 AND T is T3 AND F is F3 


THEN 


QisQ5 


IF R is R3 AND T is T2 AND F is F2 


THEN 


QisQ6 



These set of rules are implemented in MatLab's commercial Fuzzy Logic Tool- 
box [10] as seen in figure 9 to simulate our case studies and have a first approach 
to automate role assignment with RAMSET in software engineering projects. 
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Fig. 9. RAMSET' s Rules Edited in Matlab's Fuzzy Logic Toolbox 
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If we analyze a Root-Trunk relationship shown on figure 10, we can find 3 data 
zones, between 1 and 2.4 weigh for (R) and (T) we have an average 3 to 3.5 weigh 
thus recommending Developer-Programmer or Documenter. Above 2.4 for (R) 
and (T) it recommends a Tester. (T) above 2.4 and (T) below 2 it recommends an 
Architect. 




Fig. 10. Root - Trunk Relationship 

If we analyze the Root-Foliage relationship as in figure 11, we can find 3 data 
zones, between 1 and 2.5 weigh for (R) and (F) we have an average 3.5 weigh thus 
recommending Developer-Programmer or Documenter. Above 2.4 for (R) and (F) 
it recommends a Tester or Presenter. (T) below 1.5 and (F) below 3 it recommends 
a Developer-Programmer. 




R 

Fig. 11. Root-Foliage Relationship 
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Fig. 12. Trunk-Foliage Relationship 



If we analyze the Foliage-Trunk relationship as in figure 12, we can find 4 data 
zones, for any (F) and (T) below 2 we have an average 3.5 weigh thus recom- 
mending Developer-Programmer or Documenter. Above 2 for (T) and between 2 
and 3.5 for (F) it recommends an Architect or Programmer. Above 2 for (T) and 
above 3.5 for (F) it recommends a Tester or Programmer. 

From the sets of linguistic variables we can analyze each attribute highlighting 
for example, when Root (R) is null (Rl) the most probable role is Developer- 
Programmer (Q3). Without visible root (R2) we can assign Architect (Q2) or Do- 
cumenter (Q4). Any sketch of root (R3) we are talking about an Analyst (Ql) or 
Tester (Q5), even Image and Presenter (Q6). The Image and Presenter (Q6) role 
consists in selling, distribution and image design. The individual's quality per- 
forming this role has been related with his own personal image, and a high per- 
centage present the attribute (R3), drawing roots even highlighting thick roots, as 
we analyze this individual we can see he wants to draw more attention, wants to 
be noticed and depends of what other people say. 

Analyzing the Trunk (T) there are less differences between roles, wavy trunks 
(T2) are Analysts (Ql), Developer-Programmers (Q3), Testers (Q5) or Presenters 
(Q6). What it is sure in this attribute, we can distinguish an Architect (Q2) from 
the others because he draws the trunk in a trapeze shape (T3). The Foliage (F) dis- 
tinguishes an Architect (Q2) and a Tester (Q5) from other roles as they draw trees 
with Fruits (F3), others draw cloudy (F2) type most of the times. 

With the set of rules defined { 3 } we can distinguish two roles from others. Ar- 
chitect (Q2) has the only combination of without root (R2), trapeze (T3) and fruits 
(F3); and the Tester (Q5) is the only one with root (R3), trapeze or wavy (T3 o T2) 
and fruits (F3). Drawing fruits means this individual has a clear view of what he 
wants to do, he has achieved personal goals in life, giving him serenity to take 
charge of any project and achieve goals set and obtain the final product, qualities 
of a leader and architect. 
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There's a similarity between developer-programmer (Q3) and documenter (Q4) 
and between analyst (Ql) and presenter (Q6). Some cases are differentiable be- 
tween programmer with {Rl, T2, F2} and the documenter with {R2, T3, F4}, al- 
though combination { R2, T2, F2 } pops up more frequently. Also the combination 
{R3, T2, F2} does not distinguish between analyst and presenter, this results give 
us no significant difference in these cases, thus applying and increasing more 
cases will give us a more significant result. 

The results obtained for the FIS rules are from 2007-2 thru 2009-1, in the se- 
mester 2009-2 we applied the methodology in a group of 16 engineers, after the 
sociogram and personality tests; we used the tree test results and simulated them 
in the Fuzzy Logic Toolbox of Matlab [10] to prove RAMSET in assigning roles 
with our FIS model (2009-2). If we work only with Analyst, Architect and Devel- 
oper-Programmer roles, our fuzzy model can help us 100 percent in distinguishing 
each role. For larger teams that perform with more roles it helps us but we cannot 
base role assignment in only the Tree Test, which is why we are proposing the use 
of other personality tests to complement each other for best role assignment of 
team members. 

The Tree Test is only one test of many personality tests applied and proposed, 
later analysis is a first instance of observations of tree drawings, a more detailed 
analysis and collaborative judgment for interpretation and classification of 
sketches will confirm selection of them and we can propose a broader range of 
attributes to help us distinguish roles better. 
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Fig. 13. Tree Test FIS Model Architecture 



6 Conclusions 



Through the years in Software Engineering Courses, usual integration of work 
teams for software development projects has form all kind of teams, conflictive 
teams, good working teams, mediocre teams, excellent teams. Teacher's intuitive 
experience is the main tool for integrating the best teams. Using the sociogram 
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technique and personality tests has helped us in forming working teams less 
conflictive, those which by having more affinity between their members they form 
natural groups that work more fluidly and perform better. 

The objective of using RAMSET is identifying the individual's qualities to per- 
form the most suitable role in a working team. Some personalities and typologies 
have been identified to perform a type of role; we need more evidence in 
other type of teams to prove our results applied in software engineering courses 
established until now. 

The FIS model of personality Tree Test proposes linguistic variables, weights 
and membership functions applied and proven in our Software Engineering 
Courses, continuity of this work can improve the Data Base Rules arisen from this 
study and implement it as a computer aided software tool making RAMSET an 
automated tool for role assignment. 

Future work is designing and implementing direct image recognition of Trees 
drawn by individuals, and applying uncertainty in sketch analysis to determine in- 
put linguistic variables making RAMSET's automated tool more efficient and 
effective. 

We come to the conclusion that integrating work teams in Software Engineer- 
ing Courses is a highly potential area of exploration, whereas the influence of 
personality in team role assignment is a task to be clearly defined. We know that 
personality is an important factor to performance of the team, thus is latent the dif- 
ficulty to assign the adequate role to each member so the team can perform with 
success. Automation of RAMSET will help us with this task. Follow up of RAM- 
SET not only in Software Engineering Courses but in other courses of the same 
program can give us a comparative study about role assignment in work teams 
based in personality, confirming RAMSET as a methodology for integrating teams 
and an instrument to select personnel for Software Engineering Teams. 
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Abstract. The Educational timetabling problem is a common and hard problem 
inside every educative institution, this problem tries to coordinate Students, 
Teachers, Classrooms and Timeslots under certain constrains that dependent in 
many cases the policies of each educational institution. The idea behind 
hyper-heuristics is to discover some combination of straightforward heuristics to 
solve a wide range of problems. This paper presents a GA-based method that pro- 
duces general hyper-heuristics for the educational timetabling design problem us- 
ing API-Carpio methodology. The GA uses static-length representation; witch 
involves the complete encoding of a solution algorithm capable to solve schedule 
design instances, this hyper-heuristic is achieved by learning and testing phases 
using real instances from Intituto Tecnologico de Leon producing encouraging re- 
sults for most of the instances. Finally we analyze the quality of our 
hyper-heuristic in the context of real Academic timetabling process. 

Keywords: Genetic Algorithm, Timetabling, Educational Timetabling, Heuristics, 
Meta-heuristics, Hyper-heuristics. 

1 Introduction 

The timetabling problem is one of the most difficult and common problems inside 
a company, where it must done some activities such time assignment to workers 
and machinery, this activities are examples of timetabling. 

A correct timetabling is fundamental for a good performance of every compa- 
ny, to assign and utilize efficiently the resources that owned. The same problem 
can be seen into an academic environment where classrooms, teachers and 
students have a important role in the schedule design process. 

An automatic educational timetabling generation provides great benefits not 
only decreasing the time required to produce it and by, optimizing human and 
physical resources, but also by improving the precision and quality of schedules 
eliminating the human factor, in this way, a lot of many errors and conflicts like 
two or more classes assigned at the same time to a student can be avoided. 
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The proposal is the creation of a computational module which allows permits 
academic timetabling design process automatically, saving time and resources for 
the academic institution where it applies. 

The aim of this paper is to explore a novel alternative on the usage of evolutio- 
nary several approaches to generate hyper-heuristics for the schedule design 
process. A hyper-heuristic is used to define a high-level heuristic that controls 
low-level heuristics [3], The hyper-heuristic should decide when and where to ap- 
ply each single low-level heuristic, depending on the given problem state. The 
choice of low-level heuristic may depend on the features of the problem state, 
such number of conflicts, quantity of classrooms, numbers of events at the day, 
etc. 

The investigation in this article, presents a method to generate a general hyper- 
heuristic intended to provide a way to design schedules into an academic institu- 
tion, The procedure learns the hyper-heuristic by going through a training phase 
using several real instances of academic schedules of the Instituto Tecnologico de 
Leon, Mexico, The general method is based on a static-length genetic algorithm, 
where the chromosome is conformed of a blocks of heuristics, these blocks are 
directed by an auxiliary structure called Vector Direction Column (CDV) that con- 
tains the semester where the heuristics may be applied, also a novel component the 
Supervisor defined as a guiding component that ensures the applicability of the 
hyper-heuristic in real world problems. 

The paper is organized as follows. Section 2 presents the Timetablig design 
problem, the solution method proposed and its justification. Section 3 contains the 
experimental setup, the results, their analysis and discussion. Finally Section 4 we 
include our conclusions and some ideas for future work. 



2 Solution Approach 

In the literature, one can see that Evolutionary Computation has been used in 
scheduling and timetabling problems [2], [14], [15], [21]. Recently a novel metho- 
dology of schedule design has emerged [1], [4], this API-Carpio methodology has 
shown based on experience several great results in the educational timetabling de- 
sign for regular and non-regular students, this methodology is capable to find good 
solutions minimizing the conflict for regular students defined as students who 
only have in their schedules subjects of the same semester or period, and 
non-regular students defined as students who have subjects from different seme- 
sters or periods in their schedules. [1], [4] unfortunately, there still no exist an au- 
tomated computational module that can perform this method. Also Terashima et.al 
[26]. used a combination between low-level heuristics and a Genetic Algorithm 
for Constraint satisfaction strategies in examination timetabling, but did not 
incorporate the concept of hyper-heuristic. 

Evolutionary Computation usually includes several types of evolutionary algo- 
rithms [29] one common example is Genetic Algorithms [8],[9],[10],[11], In 
this research we use a Hyper-heuristic approach to resolve the schedule design 
problem using the API-carpio methodology. 
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2.1 The Academic Timetabling Design Problem through API-Carpio 
Metodology 

The solution of academic timetabling (TTP) produces a table of times [25], in 
general, the TTP problem can be seen as a set of events e, a set of time spaces 
(slots) s and a set of constrains c between events and slots. Every timetabling 
process tries to assign every event e into a slot s without violating the constrains 
c.[25]. 

So basically our problem consist into achieved a series of elements e assigned 
into several slots s, how allow us to cover several academic necessities. Each 
element e have some properties: 

A) One student group is assigned to an event e. 

B) One teacher is assigned to an event e. 

C) One subject is assigned to an event e. 

D) Each event e is assigned to a Slot s. 

E) One classroom is assigned to an event e. 

Commonly usually we have more Events e than timeslots s, so our timetabling 
process needs to assign several Events e to a single timeslot, then our Timetabling 
primary objective is assign these events to timeslot tending to several constrains 
like: 

A) Normally the same assignment of events will be repeated each week. 

B) There may be special schedules per Teacher. 

C) We need to offer special subjects per semester. 

D) There are limited number of classrooms and laboratories. 

E) Each teacher can only teach some types of subjects. 

F) Each group has a different schedule. 

G) There are a minimum and maximum number of hours that a teacher need to 
achieved in a week. 

The Api-Carpio methodology[5] presents a mathematical description of this 
behavior, we have: 

fix) = FAix) + FPix) + FI(x) (1) 

Where: 

FAix) =number of students conflicts inside the timetablig x. 
FPix) =number of teacher conflicts inside the timetablig x. 
Flix) =number of classrooms conflicts inside the timetablig x. 



With: 



FAix) = ZtiFA s . (2) 

FPOO = J$ =1 FP St (3) 

FKx)=HZ =1 FIs i (4) 
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Where: 

Fi4 s .=Number of student conflicts into Slot i of our current timetabling. 
FP s .=Number of teacher conflicts into Slot i of our current timetabling. 
F/ s .=Number of classroom conflicts into Slot i of our current timetabling. 
K = Number of timeslots available for our timetabling. 

Then: 

FA Si = Z% l^Ji 9 ( A Ua A \ g+ i) (5) 

FPs^^^'iPugAPtj+i) (6) 

FIs^^X^O^Mi^i) (7) 

Where: 

A ig A A ig+1 =number of students who demand the concurrent enrollment of 
subjects (M ig and M ig+1 ). 

Pi,g A Pi,g+i =nu mber of teachers who demand the concurrent teach of subjects 
(M ijQ and M Ug+1 ). 

h,g A h,g+i =number of classrooms who serves concurrent the subjects 
(M ig and M ig+1 ). 

M s . =Number of subjects assigned to our Slot i, inside of our timetabling. 

M ig = Subjects assigned to our slot i, inside of our timetabling. 

As it can be seen, the educational timetabling is not an easy task, so in this paper 
we focused into the first part of the methodology, the design of a timetabling only 
considering the first variable of APl-Capio strategy, Students. 

Now our problem is to choose a way of combining subjects of an academic ca- 
reer into several timeslots in order to have the less possible student conflicts, a 
student conflict is defined as a student who have two or more subjects of the same 
semester into the same timeslot.[5] 

2.2 API-Carpio Educational Timetabling Student Model 

This model considers timeslots as vectors monitoring students, teachers, 
classrooms and laboratories simultaneously. This method allows the solution con- 
struction through simple heuristics handled by the genetic algorithm, as stated 
previously we only perform the first step into the API-Carpio methodology, the 
minimization of Student conflicts into a timetabling. Because we don't perform a 
complete timetabling we consider our work as a design of a timetabling with a 
great characteristic, this timetabling will be offer the less student conflict so it is a 
excellent start point to the timetabling complete process. [5] 
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Table 1 shows the general structure of a academic career from the point of view 
API-Carpio methodology. 



Table 1. General structure of an academic career 
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Where: 

n = number of semesters or periods of the career. 

I 
1 = number of different subjects for semester i , i=l,2,3,..,,n 

L= total of subjects of a specific career. 

L = XUtt = -ei+ *2 + - + fn 

Mjj= Subject from a i semester and aj position inside the same semester 



(8) 



This vectorization is performed by assigning subjects to different vectors (timeslots). 
a hard constrain in API-Carpio methodology says : Never assign two or more 
subjects from a same semester to a same vector.[5] This rule ensures that there are 
no subjects of the same semester at the same timeslot, so a regular student, student 
who have in her schedule only subjects of the same semester or period, never will 
have a two o more subjects assigned at the same time. These vectors have a number 
which are directly related to the number of timeslots offered by the academic 
institution. An example of this timeslots can be seen in Table 2. 



Table 2. Timeslots ID offered by Institute) Tecnologico de Leon 



Hour 


Monday & 
Wednesday 


Tuesday & 
Thursday 


7:00-08:40 


1 


2 


8:45-10:25 


3 


4 


10:30-12:10 


5 


6 


12:15-13:55 


7 


8 
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API-Carpio Hard constrain can be achieved in the following way: in the first 
step all the subjects are arranged by semester or period as show in Table 1, then to 
each subject from first semester is assigned randomly different vector number be- 
tween 1 and k, where k is determined by the maximum number of timeslot offered 
by the institution, for our example we use number from 1 to 8, after we proceed 
similarly with subjects of second, third and N-semester. 

Once assigned this number we perform an ordering of all the subjects by this 
vector number, leaving a partition of the career in independent vectors, where each 
vector contains subjects of different semesters or periods. An example of how is 
constructed this Vector shown in Table 3. 

Table 3. Vector Constructions 



No 


SUBJECT 


SEMESTER 


VECTOR 


1 


ACM0403 




1 


2 


SCM0414 




6 


3 


SCB0421 




7 


4 


SCE0418 




5 


5 


ACH0408 




4 


6 


ACM0401 




2 


7 


TUT0001 




3 



Now we can read our schedule design on the following way, if one subject of 
the first semester or period was assigned the number 5 as vector number, this sub- 
ject will be offered by the institution at 10:30-12:10 on Monday and Wednesday, 
all the subjects have now assigned a specific timeslot or vector. 



2.3 Hyper-Heuristic Approach 

As noted, until now we have seen how to model the schedule design by API- 
carpio methodology, this methodology can be used by an Hyper-heuristic ap- 
proach, first is necessary to identify the hyper-heuristic concept enunciated by 
Burke[3], which defines the hyper-heuristic approach as an algorithm that manag- 
es the choice of low-level heuristics that must be applied to our problem state, de- 
pending on our current space region, This implies that the algorithm must choose 
which heuristic ingredient will selected to modify the actual state of our problem. 
A scheme of this approach can be seen on Figure 1 . 
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High level Heuristic 



T 



1 



Selection of low level heuristic 



Heuristic Ingredients 



Target function 



Fig. 1. Common Hyper-heuristic Approach 

2.4 The Set of Heuristic Used 

As show on the figure 1 is essential in first instance have a set of low-level heuris- 
tics, as well a high-level heuristic that choose the low-level ingredients to change 
the current state of the problem, in our approach we use as high-level heuristic the 
Genetic Algorithm (GA) because the adaptation of this technique to combinatorial 
problems, then we need to define a set of low-level heuristics that can been chosen 
by our GA, this set was constructed by the experience and the experimentation of 
the authors, we define our low-level heuristics as follows. 

Hl:Selection of subjects from semesters or periods separated, this heuristic 
receives a subject and returns another subject from a semester or period with a dis- 
tance of +2 o -2, for example, this heuristic receives an subject for 6th semester 
then, the heuristic could chose a subject from 8th semester or 4th semester to 
return, this check obey the number of subject currently assigned to the schedule. 

H2: Selection of a subject that minimizes the sum of current conflicts into the 
vector, this heuristic works over a vector, looking what subject minimizes the 
number on conflicts in the current vector, once selected makes the change 
assigned the subject to the vector. 

H3: Selection of a subject that minimizes the product of current conflicts into a 
vector, this heuristic works over a vector, looking for a subject that minimizes the 
product of the current conflicts (commonly a zero conflict subject) then perform 
the assignation of this subject to the vector. 

H4: Selection of a subject with the greatest conflict, this heuristic receives a 
subject then performs a check looking for the worse conflict, finally this worse 
case subject is assigned to another vector. 

This four heuristic are encoded as shows in table 4 for its use by the GA. 

Table 4. Codification of each heuristic into the chromosome 



H 


HI 


H2 


H3 


H4 


codification 


1 


2 


3 


4 
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2.5 Combining Heuristics with the Proposed GA Using API-Carpio 
Methodology 

The concept of hyper-heuristics is motivated by the objective to provide more 
general procedure for optimization [3] Meta-heuristics , methods usually solve 
problems by operating directly on the problem [11] [24], Hyper-heuristics deal 
with the process to choose the right heuristic for solving the problem at hand [29], 
[3] The aim is to discover a combination of simple heuristics that can perform well 
on a whole range of problems of the same kind [29] For real applications, exhaus- 
tive methods are not a practical approach [3] The search space might be too large, 
or number and types of constraints may generate a complex space of feasible 
solutions [9]. 

It is common to sacrifice quality solutions by using quick and simple heuristics 
to solve problems. Many heuristics have been developed for specific problems. 
But, is there a single heuristic for a problem that solves all instances well? , the 
answer is no [29], [3]. Certain problems may contain features that would make a 
specific heuristic work well, but those features may not be suitable for another 
heuristics. The idea with hyper-heuristics is to combine heuristics in such a way 
that a heuristic's strengths make up for the drawbacks of another [3]. 

The solution model used in this investigation carries features from previous me- 
thodology by Aguayo-Rios [1] and Carpio-Valadez [4] in witch the main objective 
is perform an optimal academic schedule design and work by Terashima 
et. al [29]. 

The basic concept is that, given a problem state P, we perform an algorithmic 
step to change the state from P to P'. The purpose is to solve a problem by con- 
structing the answer, deciding on the heuristic to apply at each step, then finally de- 
termine the quality of the algorithm of solution, that is to say our hyper-heuristic. 

A chromosome in our GA represents the sequence of algorithmic steps neces- 
sary to constructs a solution, this chromosome is helped by an auxiliary structure, 
called CDV. This CDV defines the direction of the heuristic that must be applied 
to construct a valid solution, moreover another software component called Super- 
visor this supervisor can change arbitrarily the content of the chromosome or its 
CDV in order to hold the solution into the real world constraint. The GA is search- 
ing for the best chromosome and CDV (representing the hyper-heuristic) which 
contains the best algorithm that resolves the Schedule design problem using the 
Api-Carpio Methodology. 

Our instances are divided into two groups training and testing sets, this in- 
stances are a real historical ones from the Intituto Tecnologico de Leon from 2003 
to 2006. The two groups have instances from a same career, the first group had 4 
instances each one between 400- 450 students and 52 subjects, the test group con- 
sist into 2 instances from the same career with 430 and 440 students respectively. 
The general procedure consist in solve simultaneously all training instances with 
Each CDV and Singles Heuristics, then the GA must search for chromosomes 
(hyper-heuristics) that brings better results in term of conflicts than the single 
Heuristics. Finally we use our best found hyper-heuristic to solve a test instance an 
see how good solves the educational timetabling design problem, it is important to 
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say that our match point will be the expert because there is no another 
computational module that uses API-Carpio Methodology to solve timetabling 
design problem. 

Representation. Each chromosome is composed of a series of blocks, the number 
of this series of blocks is L - v, where L is the total of subjects on a specific career 
and v is the number of vectors or timeslots offered by the institution, our problem 
have 52 subjects and 8 timeslots so our chromosome will have 44 blocks, this sub- 
traction is performed because all our heuristic need to work with one subject to 
make a comparation, so in the beginning we need to initialize each vector or time- 
slot with one subject, this initialization is done by separating the worse conflicts. 

Each block have a number between 1 and 4 this number is the code of a heuris- 
tic as seen on Table 4 additionally for each chromosome we have another number 
array with the same length, this array is called CDV that constrains the number of 
vector or timeslot which will be applied the heuristic coded into the chromosome, 
a proper example of this representation is shown in Table 5. 

Table 5. Representation of chromosome with CDV 



Position 1 2 ... l-v 
Heuristic 3 2 ... 4 
CDV 8 5 ... 3 



With the addition of CDV we can see our chromosome like an algorithm that can 
be solve the academic timetabling design problem using the API-Carpio methodolo- 
gy, this algorithm can perform it if we move upward the chromosome and apply the 
step coded, following our example in the table 2, in the first step we apply the heu- 
ristic 3 to the vector or timeslot 8, Each heuristic assigns one subject to a specific 
timeslot, in the next step we apply heuristic 2 to timeslot 5 and so on until reach the 
L-v position, finally we will have all of our subject assigned into our timeslots. 

The Fitness Function. The most common criterion to measure the efficiency of a 
search algorithm used to determine the quality of a timetabling design using API- 
Carpio methodology is a conflict check this verification counts the number of con- 
flicts into timetabling, A conflict is defined by the API-Carpio methodology as a 
student who have two or more subjects assigned at the same time, despite the fact 
that this methodology guarantees zero conflicts with regular students, non- 
regular students could have conflicts, so our high-level heuristic need to search 
the lowest conflict algorithm. To calculate the relative quality of an individual 
there are 3 steps that must be done: 

A)Every timetabling design instance is solved using 4 predefined chromosomes 
each chromosome have only one type of heuristic, for example the first predefined 
chromosome only have in each block the value " 1 " and so on, and the CDV of our 
current individual to measure. 

B)This results are saved into a matrix M, this matrix have a size of pxq, where 
p is the number of our test instances and q is the number of our predefined 
chromosomes , in our case is 4. 
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C) Now we proceed to solve each instance with our current chromosome and 
CDV, this measure is saved into a numeric array C of the same size of p. The 
fitness function FFi is computed as follows . 



FF i = 'Z P l=1 'Z q J=1 {C i -M i j) 



(9) 



Where: 

p = Number of our test instances. 

q= Number of our predefined chromosomes. 

Cj= Matrix with the values of our conflicts checks for each our 
chromosomes applied to test instances. 

M t j = Matrix with the values of our conflicts checks for each our 
predefined chromosomes applied to test instances. 



This evaluation guarantees that, when the general hyper-heuristic produces better 
results, the fitness function will return lower values so essentially we search for 
the minimum possible value. 

3 Experiments and Results 

This section presents the experiments carried out during the investigation and the 
results obtained. Our training set is composed for several real instances from the ca- 
reer of Engineering in computer system at Instituto Tecnologico de Leon under the 
APl-Carpio methodology from 2003 to 2008, this instances are composed by a ma- 
trix called MMA, this matrix have in its rows and columns subjects from a specific 
career, its cells contains the number of conflicts that the assignation of both sub- 
jects into a same timeslot could do. an example of this matrix is presented in Fig 2. 
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Fig. 2. Example of MMA Matrix 
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The training set was used to generate trough our high-level heuristic the best 
hyper-heuristic that can solve the training set simultaneously, then we use that 
hyper heuristic to solve another different instance from 2009. 

3.1 Experiment Type 1 

We perform several test using different initial populations sizes with the aim to 
taking the time that our AG uses to run 100 generations, The main idea is to ana- 
lyze the resolution time thought hyper-heuristics, our results are shown in Table 6. 

Table 6. Time needed to achieved 100 generations 



Population 


2 


4 


g 


16 


32 


64 


128 


256 


Time (s) 


3.16 


53.15 


96.10 


170.50 


257.20 


539.15 


640.3 


1035.41 



In general the algorithm perform a reasonable resolution time, considering that 
resolves 4 instances simultaneously. 

3.2 Experiment Type 2 

In this experiment we search for the best hyper-heuristic using the table 6 with 
changes in some AG features like elitism, selection method, initial population, 
percentage of mutation and crossover, the best hyper-heuristic was found with 
the features presented in Table 7 and Fig 3. The idea is to obtain a good 
hyper-heuristic whit a good time and fitness. 

Table 7. Best Fitness Achieved 



Feature 


Value 


Initial Population 


20 


Selection Method 


Roulette 


Elitism 


20% 


Crossover 


95% 


Mutation 


1% 


Iterations 


100 


Fitness Achieved 


-4750 


Time (s) 


188.56 
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Fig. 3. Performance of our High-level GA 
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A closer analysis of the components of this hyper-heuristic can be done if we 
consider their heuristic occurrence frequency as seen on figure 4. 



Heuristic frequency 





■ frequency 



heuristic 1 heuristic 2 heuristic 3 heuristic 4 



Fig. 4. Heuristic frequency inside our hyper-heuristic 

The most important heuristic in this result is heuristic 2, this heuristic search 
for the minimum sum conflict inside a vector or slot, in the another hand the less 
important heuristic is number 3, heuristic 1 and 2 have a similar frequency, 
this behavior show us the importance of each heuristic step in the generation of 
educational timetabling using APl-Carpio methodology. 

3.3 Analysis of Our Best Achieved Hyper-Heuristic 

After we have our best general hyper-heuristic, we use it with a new instance of 
schedule design problem, our final conflicts number was 162, this improves the 
quality of solution against the human expert who takes 1 week in order to achieve 
a schedule design solution without a proper conflict check . this 162 conflicts 
means 162 non-regular students in the whole career who could have a conflict in 
theirs schedule, in a deeper study this career have 9 semester so it is possible to 
find the exact number of conflicts per semester, this number is 162/9 =18 and 
each semester have near 6 subjects so the number of conflicts per subject is 
18/6=3, finally we have 3 students affected per subject but this conflict could be 
easily corrected if we consider 2 or 3 classes per subjects. Also we have a final 
hyper-heuristic that is applicable to any instance of computer systems engineering 
under API-Carpio Methodology without the use of the high-level GA again. 

4 Conclusions and Future Work 



This documents has described experimental results in schedule design problem 
under APl-Carpio methodology to a real academic institution by an 
Hyper-heuristic approach which involves several low-level heuristic defined by 
experience and experimentation. Overall the scheme identifies efficiently general 
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hyper-heuristics after going through a learning procedure with training and testing 
phases. When applied to unseen examples those hyper-heuristics solve many of 
the problems efficiently, in some cases better that best single heuristic for each in- 
stance. Ideas for future work involves the search and development of new low- 
level heuristics, as also, the application of new techniques inside the high-level 
GA to perform a better search for the hyper-heuristics. 
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Abstract. Today the issue of logistics is a very important within companies to the 
extent that some have departments devoted exclusively to it. This has evolved over 
time and today is a fundamental aspect in the fight business seeking to consolidate 
or remain leaders in their field. With the above we know that logistics can be di- 
vided into different classes, however, in this regard, our study is based on the timely 
distribution to the customer with a lower cost, higher sales and better utilization of 
space resulting in excellent service. Finally, prepare a comparative analysis of the 
results with respect to another method of optimization solution space. 

Keywords: Logistics, Data Mining, Cultural Algorithms, Population Space, Space 
of Beliefs, Protocol, Bin Packing, Simplex Method. 

1 Introduction 

Within the area of distribution of purified water, there isn't methodology to what 
the Service Logistics and space optimization in delivery vehicles, so that the ser- 
vice within the "La Noria" become at the logic or the need. But not have defined a 
pattern of optimal service logistics. 

The optimization problems have been attacked widely in the area of evolution- 
ary computation; this has been due largely to the kindness they have shown to 
solve such problems [2] to name a few. 

This paper addresses the solution of Logistics Service Based on Data Mining in 
combination with other techniques such as evolutionary algorithms, as is cultural 
algorithm, once the tool is implemented together with the Bin Packing is to opti- 
mize for the space within distribution vehicles purified water. 

At present there are many optimization techniques, such is the case of the Sim- 
plex Method, and Simplex Method is an iterative process that progressively allows 
an optimal solution to linear programming problems [15] to name a few, the main 
feature of this method is that it attacks the problem by restricting its maximum ca- 
pacity through the vertices of the same [10] to name a few. 
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2 Methodology 



Data mining is a process that uses several data analysis tools to discover patterns 
and relationships in data that can be used to make valid predictions [16] to name a 
few. The foundations of data mining is in the artificial intelligence and 
the statistical and using the models extracted using data mining techniques which 
addresses the solution to problems of prediction, classification and segmentation 
[14] to name a few. 

The process for conducting the data mining are: 

Selecting the dataset. 

Analysis of the properties of the data. 

Transforming the input data set. 

Select and apply the technique of data mining. 

Evaluate the results. (See Figure 1). 




Fig. 1. Diagram of Data Mining 



Data mining aims to generate information similar to that which could generate 
a human analyst: patterns, associations, changes, anomalies and significant 
structures. 

The Cultural algorithms (CA's) are a class of computational models derived 
from observing the process of cultural evolution in nature as proposed 
by Reynolds in [11] to name a few. They consist of a population and a belief space 
as shown in Figure (2). The selected individuals from the population space con- 
tribute to cultural knowledge through the role of acceptance [8] to name a 
few. Cultural knowledge resides in the space of beliefs where it is stored and up- 
dated based on individual experiences of either success or failure. The knowledge 
in the belief space can also be used to influence their individual memories [9] to 
name a few. The (AC's) in addition to space and space population of beliefs, have 
a third component of importance: communication protocol, describes how knowl- 
edge is exchanged between the first two components [13] to name a few. The 
population space can support any population based on a computational model, 
such as Genetic Algorithms and Evolutionary Programming (See Figure 2). 

Cultural Algorithms are a dual system of inheritance that characterizes the evo- 
lution of human culture in the macro-evolutionary level, which occurs within the 
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space of beliefs, and micro-evolutionary level, which occurs in the area of popula- 
tion [4] to name a few. The knowledge produced in the population that the space 
in the micro-evolutionary level is accepted or to be passed to the belief space and 
used selectively to adjust the knowledge structures there [12] to name a few. This 
knowledge can then be used to influence the changes made by the population in 
the next age. 

^ update() 



Belief Space 



accept() influence() 

f\L . ±A 

selectf) q^^ Population Space ^^) obj() 

lAPf ^C\J 

generatei) 

Fig. 2. Diagram Concept of Cultural Algorithms 

Cultural algorithms using five kinds of basic knowledge to generate an ade- 
quate solution in the search space where the problem is solved. The sources of ex- 
pertise include regulatory knowledge (ranges of acceptable behavior), situational 
awareness (or copies of reports of successful or unsuccessful solutions, etc.), do- 
main knowledge (knowledge of objects in the domain of relations they and their 
interactions), historical knowledge (temporal patterns of behavior), and topog- 
raphical knowledge (spatial patterns of behavior) [5] to name a few. To make your 
programming is relatively simple, as shown in the pseudocode in (See Fig 3). 

Begin 

f = 0; 

initialize B', P' 
repeat 

evaluate P' {obj()} 
update (B\ accept(P')) 
generate (P 1 , influence{Ehj) 
t=t+ 1; 

select P r from P'~ ' 
until (termination condition achieved) 
End 

Fig. 3. Cultural Algorithm Pseudocode 

The main difference between Cultural Algorithms and Evolutionary Computa- 
tion other techniques lies in the belief space utilization as well as the cultural in- 
fluence of the same, as are those that guide us in obtaining optimum best [3] to 
name a few. 
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3 Tools Developed 

The prototype (see Figure 4) is an intelligent hybrid system developed with the 
Java programming language that combines data mining techniques to cultural al- 
gorithms. He got a map of Fresnillo and divided into 4 zones. To get the coordi- 
nates of each colony, then we started building the data warehouse, organized in 
the following fields: area, district, year, month, day, time and sales. 

It was necessary the creation and implementation of algorithms capable of find- 
ing information in n dimensions, as well as a data clustering algorithm called K- 
Means [1] to name a few, which generates data pooling, without predefined 
classes, based on a function of similarity of the values that have different attrib- 
utes, done in unsupervised [7] to name a few (i.e., discover patterns or trends in 
the data). K-Means is a partitional clustering method (i.e. we start altogether the 
particular), where partitioning is performed a database of n objects in a set of k 
groups, seeking to optimize the chosen partitioning criterion. In K-Means each 
cluster of data is represented by a centroid. K-Means is trying to form k groups 
with k predetermined before the start of the process. The goal is to minimize the 
within-group variance [6] to name a few. 

With the use of these tools Data Mining, the prototype is able to determine 
(given a zone and a particular date) in which colonies are more likely to occur at a 
given time sale. Once these colonies is generated a population of n agents (an 
agent is the computer simulation of a person), which form a society based on cul- 
tural algorithms, which are responsible for determining, over the ages, the course 
optimal performance. 

At the time zero (when initializing the program, and the agents beliefs have an 
empty space), all agents obtain the information generated from each colony, each 
propose a route, it will lead to negotiations between the agents to select the best 
route proposed at a certain time, the belief space will be updated only when the 
proposed route is better than the previously stored in the belief space, beginning a 
cycle of improvements that will be interrupted when they occur many 
times ^(iterations in the behavior of agents) without improvements to the paths or 
when a stop condition is performed. 

Once the data mining software, proceeded to the development of software for 
Bin Packing, the prototype is a hybrid intelligent system developed in Java, using 
the technique of Cultural Algorithms. 

It began with the taking of measures of distribution vehicle on which the study 
was conducted to determine the space with which states in m 3 and measuring the 
various presentations and their respective volume, for in doing so raise the prob- 
lem and their respective restrictions. 

Also token into account the demand of different presentations, to thus more ac- 
curately determine a product's usefulness. 
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Table 1. Product Description 



PRODUCT 


CAPACITY 


% IVOLUME 
DEMAND 


UTILITY 


1 


20 It 


45 


3<5500cm 3 


t 10.00 


2 


15 !t 


15 


2(573 1cm 3 


12150 


3 


lit 


15 


18435cm 3 


$21.60 


4 


500 | 


25 


18177cm 3 


$ 38.50 



Once the measurements were made, it was necessary to create an algorithm 
capable of finding the right mix in terms of cargo is concerned, so as to optimize 
the gain of the pickup and hence of the company, the algorithm uses a popula- 
tion basis, and initializes the other as an area of belief at that time its value is 
unknown. 

It makes adjusting the demands for this way fine-tune the value, which based 
on the percentage of sales of products. Initial population is evaluated based on the 
problem and the same restriction as shown in the table 1 . 

Max z= r i m i+ r a ni] J^ + r n nin 

Subject to V|iii| + 7]iiii+...t v n m n <=V (1) 

mi.m 2 .. m» = Integer 

V= 11 38425cm 3 



Where: 
r: 

v: 

m: 

V: 



Profit per unit. 

Volume of each unit. 

Are the units of each product type. 

Maximum volume capacity. 



Any condition which violates the restrictions will be penalized so that only the 
best combinations are obtained and thus we get an average by which we can have 
the best individuals and thus influence the next generation based on the mean 
individuals. 

The result (Epochs) is the proposed solution will stop condition from which is 
repeated 7 times without change, i.e., be = > to previous. 

4 Results 



The prototype used a database of sales generated at random, with it launched the 
system functions: information classification, clusters, the generation of routes (see 
Figure 4). 
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Fig. 4. Generating optimal routes 

The system will determine the colonies in which sales have been registered 
within the specified date range. Based on the number of colonies and vehicle using 
the K-Means algorithm clusters to be generated immediately after being delivered to 
a society formed by artificial intelligent agents (representatives of a group of indi- 
viduals), which will determine the most optimal route for each cluster (see Fig 5). 




Fig. 5. Graphic Convergence Times represented in epoques 



Moreover, the software for the Bin Packing use an initial population of 100 in- 
dividuals and an area of beliefs with the same number of individuals, and that they 
could launch the system where the initial population is initialized, the Space of 
Beliefs, evaluated the results thus able to apply variation operators under the in- 
fluence of belief space and get different times to achieve the status of unemploy- 
ment or the convergence of results. 

The program determines the best combination of load so as to maximize his 
profit, that based on the volumes handled for each presentation and the capacity of 
trucks in m 3 and the demand for each presentation (see figure 6). 
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Fig. 6. Generation of Bin Packing 

At the time 33 was found the best result after 8 times without change, that the 
following responses: 

Table 2. Results 



PRESENTATION 201t 1.51t 


lit 


500 It 


QUANTITY 16 6 
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NET PROFIT $853/30 



However, making a comparison with what is the Simplex Method, we can see 
that this method gives us less effective results, as we see in Table 3. 

Table 3. Result of the Simplex Method 
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As we can see from the table, the maximum utility proposed by the Simplex 
Method is $ 615.8346, i.e. less than the $ 237.76 proposed by our program. 

5 Conclusions 

This research is being used within the purified "La Noria" trying to demonstrate 
that data mining can be used to increase sales and have a better logistics services, 
this software has a high value added for the generation and analysis logistics cou- 
pled with cultural algorithms responsible for the creation of routes, founded as a 
tool for decision making, based on the data generated daily by mobile sales. This 
is intended to provide the product to the larger population that requires service. 

Similarly, using Bin Packing algorithm optimizes the space within the distribu- 
tion units and thus provide the company a way to optimize new and creative 
through the use of this heuristic. 

It concludes with the work that has been done for this kind of logistics in terms 
of logistics service and space optimization of delivery vehicles is satisfactory and 
allows the tool to see implemented are good choices. These tools have a high 
added value because it had not previously been used for this purpose, this further 
if we consider that until recently there was no practical implementation of these 
algorithms. 
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Abstract. In this paper a hand gesture recognition method using Artificial Neural 
Networks (ANN) is presented, to evaluate this approach the three-axis acceler- 
ometer found in the Wiimote controller was used to generate a dataset of hand ges- 
tures of certain geometric shapes and letters. The gesture recognition process and 
its evaluation are discussed. 



1 Introduction 

A simple gesture like pointing is an effective method for most people to 
communicate with each other, even in the presence of language barriers [2]. This 
natural way of communication can also be an effective alternative to users who 
cannot or don't want to use traditional input devices like the keyboard, mouse or 
controllers for computer interaction. A gesture as defined by Kendon [5] is a form 
of non-verbal communication in which visible bodily actions communicate 
particular messages, either in place of speech or together and in parallel with 
spoken words. Gestures include movement of the hands, face, or other parts of the 
body. Gesture recognition research focuses on interpreting human gestures. Face 
and hand gesture recognition have received more attention, but also posture, gait, 
proxemics, and other human behaviors are also of interest [12]. 

In this paper we present an ANN approach to hand gesture recognition. The 
hand gestures where acquired using the controller of the Nintendo Wii [9] video 
game console, this is a common approach as discussed in [8]. As an experiment 
several users where asked to perform certain hand gestures using the controller 
and samples of the outputs of accelerometer where recorded for each gesture. This 
data was used to train an artificial neural network to recognize each gesture. This 
work is presented as follows: Section 2 contains a brief explanation of the device 
used, in section 3 related works is discussed, in section 4 the description of the 
problem and the approach is presented, section 5 presents an evaluation and in 
Section 6 conclusions drawn and future work are presented. 

O. Castillo et al. (Eds.): Soft Computing for Intell. Control and Mob. Robot., SCI 318, pp. 67 |-77.| 
springerlink.com © Springer-Verlag Berlin Heidelberg 2010 
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2 Wii Remote 

The Wii Remote, sometimes unofficially nicknamed Wiimote, is the primary 
controller for Nintendo's Wii console [9]. A main feature of the Wii Remote is its 
motion sensing capability, which allows the user to interact with and manipulate 
items on screen via pointing and gesture recognition, through the use infrared (IR) 
tracking technology and and a three-axial accelerometer. The Wii Remote has the 
ability to sense acceleration along three axes through the use of an ADXL330 
accelerometer [19]. The readings recived are the positive or negative acceleration 
in three axes a x , a y , a z , with a minimum full-scale range of ±3 g and also Roll, 
Pitch and Yaw angular orientation in degrees, a schematic of these reafings is 
shown in Figure 1 . 







Fig. 1. The Wiimote acceleration and orientation readings [8] 



These reading can be send through a Bluetooth [1] connection and there are 
several libraries to acquire data from this device [17] [18]. For this work the C li- 
brary wiiuse[6] was selected, this library has support for motion sensing, IR track- 
ing, and also other Nintendo controllers. The accelerometer is very sensitive and 
can produce significant noise, to alleviate this; the wiiuse library generates a read- 
ing (an event) only if certain threshold is reached. IR tracking can be used to esti- 
mate the position of the controller, but has the disadvantage of requiring additional 
devices and users need to point the controller in certain direction. Using just the 
accelerometer readings to estimate the position of the controller is a difficult task, 
in this approach the hand gesture is not interpreted as the geometric shape, instead 
the pattern of acceleration changes sampled as the user is drawing the gesture are 
recognized. 
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3 Related Work 

Gesture recognition can be conducted using different techniques and devices from 
computer vision to image processing, next some works on sensor based gesture 
recognition are presented. William Freeman and Craig Weissman [4] have worked 
on virtual controllers for systems "the act of finding or acquiring a physical 
controller could require too much time, gestures can be used as an alternative 
control mechanism". Also research in robot assisted patient rehabilitation is 
carried out using different sensors (accelerometers and gyros) worn on the body of 
a patient [2,16,11,15]. Also C Dinh, D Tantinger, M Struck [3] have worked on a 
wearable sensor based fall detection system for the elder. Tong Zhang, Jue Wang, 
Ping Liu and Jing Hou [20], use a cell phone to detect falls. Parera, Cecilio 
Angulo [10] proposed the use of accelerometers to identify daily activities. 

WiiRemote based hand gesture recognition recently has gained attention, Jozef 
Mitch [8] presents an approach using Hidden Markov Models, Jun Ki Lee [7] have 
worked on a "Gesture Recognition on Avatar Control System: A Puppeteering 
System for the Huggable". Thomas Schlomer and Benjamin Poppinga [14], aim at 
recognizing gestures to interact with an application in their work they present the 
design and evaluation of sensor-based gesture recognition using Hidden Markov 
Model and a Bayesian approach; in their experiments they used 7 users making 15 
gestures for each of their figures, their results have an average of 90% of accuracy 
using a leave-one-out test. 

4 Problem Description and Strategy 

As an experiment five gestures where defined: Circle, Square, Triangle, and the 
letters S and Z. Although these are two-dimensional shapes, they are expressed by 
users as a hand gesture in a three dimensional space, some of these gestures are 
also used in work done by Thomas Schlomer [14] and Jozef Mitch [8]. Gestures 
that involve orientation readings (Roll, Pitch, Yaw) where not considered. 





1. -Circle 



2.-Square 3-Letter "S" 4.-Triangle 5.-Letter"Z" 

Fig. 2. Gestures made by people to generate the dataset 



4.1 Dataset 

To read each gesture a Python script program was implemented using the wiiuse 
library. Only the three-axial acceleration was considered for a feature vector of the 
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gestures, Figure 3 shows an example a plot of the samples for the Circle gesture. 
The plot consists of 50 samples of the acceleration reported by the accelerometer; 
each sample is defined as a triplet (a x , a y , a z ) with acceleration in axes X, Y and Z. 
Samples were gathered by constantly polling the wiiuse library for events, an 
event is triggered when a significant change in acceleration in any of the axes is 
encountered. Because of this sampling method, the number of samples for each 
gesture is not constant, not even for the same gesture and the same user. Also al- 
though samples are ordered in time, they cannot be taken in constant intervals of 
time. Time was not a variable considered in these experiments. Each gesture is re- 
corded as a variable-length list of triplets (a x , a y , a x ). 
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Fig. 3. Acceleration samples for the Circle gesture 



To generate the dataset, three users were asked to draw in the air the gestures 
shown in Figure 2, starting at the position where the arrow begins and finishing at 
the same point. To indicate when each gesture started and ended, the users kept a 
button pressed as they draw the hand gesture. All users were right-handed, with no 
disabilities and had used the WiiRemote before. Each gesture was repeated ten 
times by each user. The average number of samples triggered by each user is 
shown in Figure 4. For recognition purposes a single feature vector was generated 
for each gesture. Observing the average samples generated by users, a vector with 
a length of 100 samples was defined. If a gesture is captured with fewer samples, 
as is usually the case, the remaining samples are filled with zeros, and if the ges- 
ture generates more samples, those samples are ignored. The length of the vector 



Accelerometer-Based Hand Gesture Recognition Using Artificial Neural Networks 71 

depends on the type of gestures captured, for the simple gestures defined for this 
experiment, 100 samples were adequate. Figure 5 shows the acceleration samples 
for 4 hand gestures, made by the same user, it can be observed that there are dif- 
ferences in the pattern and number of samples. 
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Fig. 4. Average number of samples in gestures made by each user 
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Fig. 5. Examples of the samples of 4 of the gestures used 
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The samples of each axis were concatenated resulting in a feature vector of 300 
samples, an example is shown in Figure 6. The first 100 samples are from a x , the 
next 100 from a y and the rest from a z . The final dataset consists of 150 feature 
vectors, ten for each gesture for three users. 



1.5 r 



0.5 



Concatenated Vector of all Axes 
Circle 






100 



150 

Samples 



200 



250 



300 



Fig. 6. Example of the resulting feature vector after the concatenation of the samples taken 
from each axis 



Figure 7 shows the feature vectors of the Circle gesture made by each user. Al- 
though is the same gesture, differences between users are noticeable, but also ges- 
tures made by the same user are similar. 
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Fig. 7a. Feature vectors for the Circle gesture, 10 by each user. (Up view) 
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7b. Feature vectors for the Circle gesture, 10 by each user. (3D view) 
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4.2 ANN Architecture 

Neural networks have been used successfully in other studies of pattern recogni- 
tion like biometric measurements or predictions of time series, to recognize each 
pattern may have different optimal architecture for this case is proposed an archi- 
tecture shown in Figure 8. 
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Fig. 8. Proposed Feed Forward Neural Network 

This architecture has 300 nuerons entries, 50 neurons in the hidden layer and 5 
outputs but can be change for including or using fewer gestures. 



4.3 Implementation 

This neural network is implemented in Python using the PyBrain library [13]. The 
neural network used is a backpropagation classifier with 0.001 of learning rate, 0.3 
of momentum and trained by 5000 epochs. 

The recognition process can be summarized in these steps: 

1 . Data is acquired from the Wiimote via Bluetooth, by a Python script. 

2. Variable length vectors where exported to MatLab to create and con- 
catenate the feature vectors of each gesture. Several plots can be gen- 
erated from the feature vectors. 

3. Then a matrix from MatLab is exported to a textfile. 
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4. A Python script reads the textfile and the Neural Network is trained 
and evaluated. 



5 Evaluation 

5.1 First Experiment 

In this experiment, the entire feature vectors was used, 24 gestures of each figure 
for training and the remaining 6 gestures for testing. The network had a hidden 
layer of 50 neurons and was trained for 5000 epochs with a learning rate of 0.001. 
The experiment was repeated 10 times with an average of 83.33% Accuracy. In 
Table 1 the confusion matrix of this experiment is shown. 

Table 1. Confusion matrix of all gestures 
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5.2 Second Experiment 

In this experiment, the two gestures with false results in experiment 1 were used, 
24 gestures of each figure for training and 6 for testing. The network was trained 
with a hidden layer of 20 neurons with 5000 epochs and 0.001 of learning rate, 
with an average of 91.6% accuracy in ten runs. In Table 2 the confusion matrix of 
this experiment is shown. 

Table 2. Confusion matrix of Circle and Triangle Gestures 





Circle 


Triangle 


Circle 


5 


1 


Triangle 





6 



6 Conclusions 



In this paper a present a hand gesture recognition method using Artificial Neural 
Networks (ANN) is presented. Some preliminary results showed that neural net- 
works are viable tool for hand gesture recognition. 
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As future work, the dataset of the gestures needs to be extended, with more us- 
ers and more figures in order to produce more valid experiments and results. Also 
more features can be proposed to achieve higher accuracy. 
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Abstract. This paper applied a new Kalman Filter Recurrent Neural Network 
(KFRNN) topology and a recursive Levenberg-Marquardt (L-M) learning algo- 
rithm capable to estimate parameters and states of highly nonlinear unknown plant 
in noisy environment. The proposed KFRNN identifier, learned by the Back- 
propagation and L-M learning algorithm, was incorporated in a direct and indirect 
adaptive neural control schemes. The proposed control schemes were applied for 
real-time recurrent neural identification and control of a continuous stirred tank 
bioreactor model, where fast convergence, noise filtering and low mean squared 
error of reference tracking were achieved. 

Keywords: Backpropagation learning, continuous stirred tank bioreactor model, 
direct and indirect adaptive neural control, Kalman filter recurrent neural identifi- 
cation model, Levenberg-Marquardt recursive learning, recurrent trainable neural 
network controller, sliding mode control. 



1 Introduction 

The universal approximation abilities of the artificial neural networks to approxi- 
mate complex non-linear relationships without prior knowledge of the model 
structure, makes them a very attractive alternative to the classical modeling and 
control techniques [1], [2], [3]. This property has been proved by the universal ap- 
proximation theorem, [3]. Among several possible network architectures the ones 
most widely used are the Feedforward (FFNN) and the Recurrent Neural Net- 
works (RNN). In a feed-forward neural network the signals are transmitted only in 
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one direction, starting from the input layer, subsequently through the hidden layers 
to the output layer, which requires applying a tap delayed global feedbacks and a 
tap delayed inputs to achieve a nonlinear autoregressive moving average neural 
dynamic plant model. A recurrent neural network has local feedback connections 
to some of the previous layers. Such a structure is suitable alternative to the first 
one when the task is to model dynamic systems, and the universal approximation 
theorem has been proved for the recurrent neural networks too. Several RNN 
models applications has been described in [4]-[10]. The preferences given to re- 
current neural network identification with respect to the classical methods of proc- 
ess identification are clearly demonstrated in the solution of the "bias-variance di- 
lemma", [3]. Furthermore, the derivation of an analytical plant model, the 
parameterization of that model and the Least Square solution for the unknown pa- 
rameters have the following disadvantages: (a) the analytical model did not in- 
clude all factors having influence to the process behavior; (b) the analytical model 
is derived taking into account some simplifying suppositions which not ever 
match; (c) the analytical model did not described all plant nonlinearities, time lags 
and time delays belonging to the process in hand; (d) the analytical model did not 
include all process and measurement noises which are sensor and actuator de- 
pendent. In (Sage, [11]) the method of invariant imbedding has been described. 
This method seemed to be a universal tool for simultaneous state and parameter 
estimation of nonlinear plants but it suffer for the same drawbacks because a com- 
plete nonlinear plant model description is needed. Furthermore, the managing of 
noisy input/output plant data is required to augment the filtering capabilities of the 
identification RNNs, [12]. Driven by these limitations, a new Kalman Filter Re- 
current Neural Network (KFRNN) topology and the recursive Backpropagation 
(BP) learning algorithm in vector-matrix form has been derived [13] and its con- 
vergence has been studied [13], [14]. But the recursive BP algorithm, applied for 
KFRNN learning, is a gradient descent first order learning algorithm which does 
not allow to augment the precision and accelerate the learning [12], [14]. There- 
fore, the aim of this paper was to use a second order learning algorithm for the 
KFRNN, as the Levenberg-Marquardt (L-M) algorithm is, [15]. The KFRNN with 
L-M learning was applied for Continuous Stirred Tank Reactor (CSTR) model 
identification [16], [17]. The application of KFRNNs together with the recursive 
L-M could prevent all the problems caused by the use of the FFNN, thus improv- 
ing the learning and the precision of the plant state and parameter estimation in 
presence of noise. Here, the parameters and states, obtained from the KFRNN 
identifier will be used to design a Direct and Indirect Adaptive Neural Control 
(DANC and IANC) of CSTR bioprocess plant model. 

2 Kalman Filter RNN 

This section is dedicated to the KFRNN topology, the recursive Backpropagation 
and the recursive Levenberg-Marquardt algorithms for the KFRNN learning. The 
KFRNN is applied as a state and parameter estimator of nonlinear plants. 
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2.1 Topology of the KFRNN 

Let us consider the linearized plant model (1), (2), represented in a state- space 
form: 

X d .(k+1) = A d (k) X d (k) + B d (k) U(k) + 0,(k) (1) 

Y d (k) = C d (k)X d (k) + 2 (k) (2) 

Where: E [.] means mathematical expectation; the process and measurement nois- 
es 0i(.), ©2 (•) are white, with 0i(k), 2 (s) and the initial state X d (ko) independent 
and zero mean for all k, s, with known variances E [X d (k) X d T (k)] = P , E[0i(k) 
0! T (k)] = Q(k) 5 (k-x), E[0 2 (k) 2 T (k)] = R(k) 5 (k-x), where 5 (k-x)=l if k= x, 
and otherwize. The optimal Kalman filter theory is completely described in [4], 
and we would not repeat it here. For us the Kalman Filter (KF) is a full rank opti- 
mal state estimator capable to estimate the system states, to filter the process and 
measurement noises, taking in hand all plant information available like: in- 
put/output plant data, all parameters of the plant model (1), (2), and the given up 
noise and initial state statistics (mean and variance). The basic Kalman filter equa- 
tions for the estimated state and output variables are given by: 

X,(k+1) = A e (k) X e (k) + K e (k) Y d (k) + B d (k) U(k) (3) 

A e (k) = A d (k)-K e (k)C d (k) (4) 

Y e (k) = C d (k)X e (k) (5) 

Where: X e (k) is the estimated state vector with dimension N e ; A e (k) is a (N e x N e ) 
closed-loop KF state matrix; Y e (k) is the estimated plant output vector variable 
with dimension L; K e (k) is the optimal Kalman filter gain matrix with dimension 
(N e x L). This gain matrix is computed applying the optimal Kalman filtering me- 
thodology given in [11]. So, the KF performed noise filtration by means of an op- 
timal closed-loop feedback which has the drawback that the feedback amplified 
the noise components of the error, especially when the feedback gain is high. The 
second drawback is that the KF design needs complete plant parameter and noise 
information, which means that if the plant data are incomplete the process noise 
level is augmented. To overcome this we need to take special measures like to 
augment the filtering capabilities of the KF. The third drawback is that the KF 
could not estimate parameters and states in the same time processing noisy meas- 
urements with unknown noise statistics, and it will be our task. To resolve this 
task we need to derive the topology and the BP learning algorithm of a new recur- 
rent KF-like neural network subject to learning and capable to estimate parameters 
and states in the same time. First of all we could rewrite the equation (3) defining 
a new extended input vector, containing all available input/output information is- 
sued by the plant, and second - we could modify the output equation (5) so to 
convert it to an output noise filter. After that we obtain: 

X,(k+1) = A d (k) X e (k) - K e (k) Y e (k) + B 2 (k) U e (k) (6) 
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B 2 =[B d ;K e ];U e T =[U;Y d ] 


(7) 


Z(k) = C d (k)X e (k) 


(8) 


Y e (k+l) = A 2 Y e (k) + Z(k) 


(9) 



The obtained new KF RNN topology is given on Fig. 1 . 

The first layer of the KFRNN represented the plant model (equations (10)-(13)) 
and the second layer - represented the output noise filtering model (equations 
(14)-(18)). The KF RNN topology is described by the following equations: 

(10) 



X(k+1) = A ; X(k) + BU(k) - DY(k) 
B = [B i; B ];U T =[U i; U 2 ] 



(11) 



U(k) J x(k+l) 



X(k) 



^J 



G[X(k)] 



Z(k) 



-*© 



V(k+1) 



IZ 



V(k) 



& 



F[V(k))] 



Y(k) 



Fig. 1. Block - diagram of the KFRNN topology 



Ai= block-diag (A^), I A u I < 1 

Zj(k) = G[X(k)] 

C=[C 1 ;C ];Z T =[Z 1 ;Z 2 ] 

Vi(k) = CZ(k) 

V(k+1) = Vi(k) + A 2 V(k) 

A 2 = block-diag (A 2l ), IA 2>i I < 1 

Y(k) = F[V(k)] 



(12) 
(13) 
(14) 
(15) 
(16) 
(17) 
(18) 



Where: X, Y, U are vectors of state, output, and augmented input with dimensions 
N, L, (M+l), respectively, Z is an (L+l) -dimensional input of the feedforward 
output layer, where Zi and Ui are the (Nxl) output and (Mxl) input of the hidden 
layer; the constant scalar threshold entries are Z 2 = -1, U 2 = -1, respectively; V is a 
(Lxl) pre-synaptic activity of the output layer; the super-index T means vector 
transpose; Aj, A 2 are (NxN) and (LxL) block-diagonal weight matrices; B and C 
are [Nx(M+l)] and [Lx(N+l)]- augmented weight matrices; B and C are (Nxl) 
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and (Lxl) threshold weights of the hidden and output layers; F[.], G[.] are vector- 
valued tanh(.) or sigmoid(.) -activation functions with corresponding dimensions. 
Here the input vector U and the input matrix B of the KF RNN are augmented so 
to fulfill the specifications (7) and the matrix D corresponded to the feedback gain 
matrix of the KF. The dimension of the state vector of the KF RNN is chosen us- 
ing the simple rule of thumb, which is: N=L+M. 

2.2 BP Learning of the KFRNN 

So the KF RNN topology corresponded functionally to the KF definition (6)-(9) and 
ought to be learnt applying the BP learning algorithm derived using the adjoint KF 
RNN (see Fig.2) based on KF RNN topology applying the diagrammatic method, 
[18]. The BP learning algorithm, expressed in vector - matricial form is as follows: 

(19) 



W(k+1) = W(k) +ti AW(k) + a AW(k-l); IWy I < W 

E(k) = Y d (k) - Y(k); E^k) = F'[Y(k)] E(k) 

F'[Y(k)] = [l-Y 2 (k)] 

AC(k) = E,(k) Z T (k) 

AA 2 (k) = E,(k) V T (k) 

Vec(AA 2 (k)) = E!(k)-X(k) 



(20) 
(21) 
(22) 
(23) 
(24) 
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B T 
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Fig. 2. Block - diagram of the adjoint KFRNN topology 



E 3 (k) = G'[Z(k)] E 2 (k); E 2 (k) = C'(k) E^k) 
G'[Z(k)] = [l-Z 2 (k)] 
AB(k) = E 3 (k) U T (k) 
AD(k) = E 3 (k) Y T (k) 
AAj(k) = E 3 (k) X T (k) 
Vec(AA,(k)) = E 3 (k)°X(k) 



(25) 
(26) 
(27) 
(28) 
(29) 
(30) 
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Where: F'[.], G'[.] are derivatives of the tanh(.) activation functions; W is a gen- 
eral weight, denoting each weight matrix (C, A u A 2 , B, D) in the KF RNN model, 
to be updated; AW (AC, AAi, AA 2 , AB, AD), is the weight correction of W; Y d is 
an L-dimensional output of the approximated plant taken as a reference for KF 
RNN learning; T|, a are learning rate parameters; AC is an weight correction of C; 
AB is an weight correction of B; AD is an weight correction of D, AAi is the 
weight correction of Ai _ AA 2 is the weight correction of A 2 ; the diagonals of the 
matrices A,, A 2 are denoted by Vec (A,(k)), Vec (A 2 (k)), respectively, where 
(24), (30) represented their learning as an element-by-element vector products; E, 
Ei, E 2 , E 3 , are error vectors (see Fig. 2), predicted by the adjoint KF RNN model. 
So, the KF RNN is capable to issue parameter and state estimations for control 
purposes, thanks to the optimization capabilities of the BP learning algorithm, ap- 
plying the "correction for error" delta rule of learning (see Haykin, [3]). The sta- 
bility of the KF RNN model is assured by the activation functions [-1, 1] bounds 
and by the local stability weight bound conditions given by (12), (17). The stabil- 
ity of the KF RNN movement around the optimal weight point has been proved by 
one theorem and the rate of convergence lemma, (see the Ph.D. thesis of Mariaca 
[14]). It is stated below. 

Theorem of stability of the BP KF RNN applied as a plant identifier, [14]. Let 

the KF RNN topology is given by equations (10)-(1 8) (see Fig.l) and the nonlin- 
ear plant model, is as follows: 

X d .(k+l) = G[X d (k),U(k)] (31) 

Y d (k) = F[X d (k)] (32) 

Where: (Y d (.), X d (.), U(.)} are output, state and input variables with dimensions 
L, N d , M, respectively; G(.), F(.) are vector valued nonlinear functions with re- 
spective dimensions. 

Under the assumption of KF RNN identifiability made, the application of the 
BP learning algorithm for C, A b A 2 , B, D, in general vector-matrix form, de- 
scribed by equation (19)-(30), and the learning rates r| (k), a (k) (here they are 
considered as time-dependent and normalized with respect to the error) are de- 
rived using the following Lyapunov function: 

L(k) = Li(k) + L 2 (k) (33) 

Where: L t (k) and L 2 (k) are given by: 

L I (k) = le 2 (k) 
L 2 (k) = tr(W A1(k ,W A T 1(k) )+ tr(WA 2(k )W A T 2(k >tr(W B(k) Wj (k) )+tr(Wc ( MW c T (k) )+tr(WD (k )W D T (k) ) 

Where: 

WAi(k) = Ai(k)-Ai,WA2(k) = A2(k)-A2,WB(k) = B(k)-B*,Wc(k) = C(k)-C*,WD(k) =D(k)-D* 
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are vectors of the estimation error and (Ai,A2,B*,C*,D*) and 
(Ai(k),A2(k),B(k),C(k),D(k)) denote the ideal neural weight and the estimate of neural 
weight at the k-th step, respectively, for each case. 
Then the identification error is bounded, i.e.: 

L(k+l) = Li(k+l) + L 2 (k+l)<0 (34) 

AL(k+l) = L(k+1) - L(k) (35) 

Where the condition for L,(k+1) < is that: 

^ H 

<J] max <- 



UJmax UJmax 

and for L 2 (k+1) < we have: 

AL2(k+l)<-r]max|e(k+l)| 2 -amax|e(k)| 2 +d(k+l) (36) 

Note that rjmax changes adaptively during the learning process of the network and: 

5 

tlmax=max{rii} 

i-i 

Where all: the unmodeled dynamics, the approximation errors and the perturba- 
tions, are represented by the d-term, and the complete proof of that theorem is giv- 
en in [14]. 

2.3 Recursive Levenberg-Marquardt Learning of the KFRNN 

The general recursive L-M algorithm of learning, [12], [14], [15] is given by the 
following equations: 

W(k +1) = W(k) + P(k)VY[W(k)]E[W(k)] (37) 

Y[W(k)]= g[W(k),U(k)] (38) 

E 2 [W{k)] = {Y p (k)-g[W(k),U(k)]} 2 (39) 



DY[W(k)] = — -g\W,U(k)] 

dW 



(40) 

W=W(t) 



Where: W is a general weight matrix (Ai, A 2 , B, C, and D) under modification; P 
is the covariance matrix of the estimated weights updated; DY[.] is an nw- 
dimensional gradient vector; Y is the KFRNN output vector which depends of the 
updated weights and the input; E is an error vector; Yp is the plant output vector, 
which is in fact the target vector. Using the same KFRNN adjoint block diagram 
(see Fig. 2), it was possible to obtain the values of the gradients DY [.] for each 
updated weight, propagating the value D (k) = I through it. Following the block 
diagram of Fig. 2, equation (37) was applied for each element of the weight 
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matrices (A b A 2 , B, C, D) in order to be updated. The corresponding 
components are as follows: 

DY[C lJ (k)] = D ll (k)Z j (k) 

DY[A 2iJ (k)] = D lJ {k)V j {k) 

D u {k) = F' i [Y i (k)] 

DY[A UJ (k)] = D 2i (k)X j (k) 

Dy[B,(fe)] = D 2 , ; (^)f/,(fe) 

DY[D lj (k)] = D 2i (k)Y j (k) 

D 2i (k) = G' i [Z i (k)]C l D li (k) 

Therefore, the Jacobean matrix could be formed as: 

DY[W(k)] = [DYiC^DY^ik^DYiBfi)) 
DY^ik^DYiD^k))] 

The P(k) matrix was computed recursively by the equation: 

P(k) = cr\k){P(k - 1) - P(k - 1). 

Q[W{k)}S- 1 [W(k)}Q. T [W(k)}P{k - 1)} 

Where the S(.), and 0(.) matrices were given as follows: 
S[W(k)} = a{k)Mk) + a T [W{k)]P{k - l)Q[W(k)] 

0.97<a(fc)<l;10 3 <P(0)<10 6 

VY T [W(k)} 
1 

;1(T 4 <p<10- 6 



a T [w{k)] = 





A(fc)- 1 = 


1 




0" 

p 



gradient 

(41) 
(42) 
(43) 
(44) 
(45) 
(46) 
(47) 

(48) 
(49) 

(50) 



(51) 



The matrix Q. (.) had a dimension (nwx2), whereas the second row had only one 
unity element (the others were zero). The position of that element was computed 

by: 
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i = kmod(nw) + \,k > nw (52) 

After this, the given up topology and learning were applied for the CSTR system 
identification. 



3 Recurrent Trainable NN 

This section is dedicated to the topology, the BP and the L-M algorithms of 
RTNN learning. The RTNN could be obtained from the KFRNN removing the 
output local and global feedbacks. The RTNN was used as a feedback/feedforward 
controller. 

3.1 Topology of the RTNN 

The RTNN model and its learning algorithm of dynamic BP-type, together with 
the explanatory figures and stability proofs, are described in [13], [14], so only a 
short description will be given here. The RTNN topology, derived in vector- 
matrix form, was given by the following equations: 

X(k+l) = AX(k) + BU(k) (53) 

B = [B 1 ;B ];U T =[U, T ;U 2 T ] (54) 

A = block-diag (Ai), lAi I < 1 (55) 

Z,(k) = G[X(k)] (56) 

C = [C,;C ];Z T =[Z 1 T ;Z 2 T ] (57) 

V(k) = CZ(k) (58) 

Y(k) = F[V(k)] (59) 

Where: X, Y, U are vectors of state, output, and augmented input with dimensions 
N, L, (M+l), respectively, Z is an (L+l) -dimensional input of the feedforward 
output layer, where Zi and Ui are the (Nxl) output and (Mxl) input of the hidden 
layer; the constant scalar threshold entries are Z 2 = -1, U2 = -1, respectively; V is a 
(Lxl) pre-synaptic activity of the output layer; the super-index T means vector 
transpose; A is (NxN) block-diagonal weight matrix; B and C are [Nx(M+l)] and 
[Lx(N+l)]- augmented weight matrices; B and C are (Nxl) and (Lxl) threshold 
weights of the hidden and output layers; F[.], G[.] are vector-valued tanh(.) or 
sigmoid(.) -activation functions with corresponding dimensions. Equation (55) 
represents the local stability condition imposed on all blocks of A. The dimension 
of the state vector X of the RTNN is chosen using the simple rule of thumb which 
is: N=L+M. 
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3.2 BP Learning of the RTNN 

The same general BP learning rule (19) was used here. Following the same proce- 
dure as for the KFRNN, it was possible to derive the following updates for the 
RTNN weight matrices: 

E(k) = Y d (k) - Y(k); E,(k) = F'[Y(k)] E(k) (60) 

AC(k) = Ei(k) Z T (k) (61) 

E 3 (k) = G'[Z(k)] E 2 (k); E 2 (k) = C T (k) E,(k) (62) 

AB(k) = E 3 (k) U T (k) (63) 

AA(k) = E 3 (k) X T (k) (64) 

Vec(AA(k)) = E 3 (k)°X(k) (65) 

Where A A, A B, A C are weight corrections of the of the learned matrices A, B, 
and C, respectively; E, El, E2, and E3 are error vectors; X is a state vector; F'(-) 
and G'(.) are diagonal Jacobean matrices, whose elements are derivatives of the 
tanh activation functions (see equations (21) and (26)) . Equation (64) represents 
the learning of the full feedback weight matrix of the hidden layer. Equation (65) 
gives the learning solution when this matrix is diagonal vA, which is the present 
case. The initial values of the weight matrices during the learning are chosen as 
arbitrary numbers inside a small range. The stability of the RTNN model used as a 
direct controller is assured by the activation functions [-1, 1] bounds and by the 
local stability weight bound condition given by (55). The stability of the RTNN 
movement around the optimal weight point has been proved by one theorem (see 
thesis of Mariaca [14]). 

Theorem of stability of the BP RTNN applied as a direct system controller, 

[14]. Let the RTNN with Jordan Canonical Structure is given by equations (53)- 
(59) and the nonlinear plant model is given by (31), (32). Under the assumption of 
RTNN identifiability made, the application of the BP learning algorithm for A(.), 
B(.), C(.), in general matricial form, described by equation (19), (63)-(65) without 
momentum term, and the learning rate r\ (k) (here it is considered as time- 
dependent and normalized with respect to the error) are derived using the follow- 
ing Lyapunov function: 

L(k) = L 1 (k) + L 2 (k) (66) 

Where: Li (k) and L 2 (k) are given by: 

L 1 (k) = le 2 (k) 

L 2 (k) = tr(WA, k )Wl (k) ) + tr(WB M W B T (k) ) + tr(Wc( k) W c T (k) ) 



Where 



WA(k) = A(k) - A*,WB(k) = B(k) -B*,Wc(k) = C(k) -C* 
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are vectors of the estimation error and (A*,B*,C*) and (A(k),B(k),C(k>) denote the 
ideal neural weight and the estimate of neural weight at the k-th step, respectively, 
for each case. 

Let us define: y/ max = max* MM &) , and z? max = max* k?( k) II, where 
\//{k) = do(k) J dW(k), and &(k) = dy(k) / du(k) , where W is a vector 
composed by all weights of the RTNN, used as a system controller, and ■ is an 

Euclidean norm in 9t" . 

Then the identification error is bounded, i.e.: 

L(k+l) = Li(k+l) + L 2 (k+l)<0 (67) 

AL(k+ 1 ) = L(k+ 1 ) - L(k) (68) 

Where the condition for L, (k+1) < is that: 

9 

< /7ma; 

and for L 2 (k+1) < 0, we have: 



lAnaxy'max 



ALi(k + 1) < -77max \e(k + if + j3(k + 1) (69) 

Note that rjmax changes adaptively during the learning process of the network and 

3 

r)max=max{r]i} 

i-i 

Where all: the unmodelled dynamics, the approximation errors and the perturba- 
tions, are represented by the P-term, and the complete proof of that theorem and 
the rate of convergence lemma, are given in [14]. 

3.3 Recursive Levenberg-Marquardt Learning of the RTNN 

The general recursive L-M algorithm of learning [12], [14], [15] is given by equa- 
tions (37)-(40), where W is the general weight matrix (A, B, C) under modifica- 
tion; Y is the RTNN output vector; E is an error vector; Yp is the plant output vec- 
tor. Using the RTNN adjoint block - diagram [12], it was possible to obtain the 
values of DY [.] for each updated weight propagating D=I. Applying equation (40) 
for each element of the weight matrices (A, B, C), the corresponding gradient 
components are obtained as: 

DY[C iJ (k)] = D li (k)Z J {k) (70) 

D lJ (k) = F' i [Y i (k)] (71) 
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DY[A ij (k)] = D 2i (k)X j (k) (72) 

DY[B iJ (k)] = D 2i (k)U J (k) (73) 

D 2i (k) = G' i [Z i (k)]C i D li (k) (74) 

Therefore the Jacobean matrix could be formed as: 

DY\W(k)] = [DY(C tj (k)), DF(4 (k)), DY{B t] (k))] (75) 

The P (k) matrix was computed recursively by equations (49)-(52). Next, the giv- 
en up RTNN topology and learning were applied for CSTR system control. 

4 Adaptive Control Systems Design 

This section is dedicated to the design of direct and indirect (sliding mode) adap- 
tive control system using the KF RNN as a nonlinear plant identifier. The RTNN 
was used as a feedback/feedforward controller in the case of direct adaptive neural 
control. 

4.1 Direct Adaptive Neural Control Scheme 

This section describes the direct adaptive control using KFRNN as plant identifier 
and RTNN as a plant controller (feedback / feedforward). The block-diagram of 
the control system is given in Fig. 3. The following study describes the linearized 
model of that closed-loop control system. 

Let us present the following z-transfer function representations of the plant, the 
state estimation part of the KFRNN, and the feedback and feedforward parts of the 
RTNN controller: 

W p (z) = C p (zI-A p y 1 B p (76) 

P i {z)=(zI-A i y l B i (77) 

Q l (z) = C c (zI-Aj'B lc (78) 

Q 2 {z) = C c {zI-Aj l B 2c (79) 

The control systems z-transfer functions (76)-(79) are connected by the following 
equation, which is derived from Fig. 3, and is given in z-operational form: 

Y p (z) = W p (z)[l + Q l (z)P i (z)Y 1 Q 2 (z)R(z) + 0(z) (80) 

0(z)=W p (z)e i (z) + e 2 (z) (81) 
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Where: 9 (z) represents a generalized noise term. The RTNN and the KFRNN to- 
pologies were controllable and observable, and the BP algorithm of learning was 
convergent, [12], [14], so the identification and control errors tended to zero: 
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Fig. 3. Block - diagram of the closed-loop neural control system 



£,(A) = r p (*)-y(*)-»0;*-»oo 



E c (k) = R(k)-Y p (k)^0; k 



> CO 



(82) 



(83) 



This means that each transfer function given by equations (76)-(79) was stable 
with minimum phase. The closed-loop system was stable and the feedback dy- 
namical part of the RTNN controller compensated the plant dynamics. The feed- 
forward dynamical part of the RTNN controller was an inverse dynamics of the 
closed-loop system one, which assured a precise reference tracking in spite of the 
presence of process and measurement noises. 

4.2 Indirect Adaptive Control Scheme (Sliding Mode Control) 

The indirect adaptive control using the RTNN as plant identifier has been de- 
scribed in [12]. Later the proposed indirect control has been derived as a Sliding 
Mode Control (SMC) and applied for control of unknown hydrocarbon biodegra- 
dation processes, [13], using the KF RNN identifier with BP learning. Here we 
applied the KF RNN identifier with L-M learning. The block diagram of the indi- 
rect adaptive control scheme is shown on Fig. 4. It contains identification and state 
estimation KF RNN and a sliding mode controller. The stable nonlinear plant is 
identified by a KF RNN model with topology, given by equations (10)-(18) 
learned by the stable BP-learning algorithm, given by equations (19)-(30), or us- 
ing the second order LM-learning algorithm, given by equations (37)-(52). The 
simplification and linearization of the neural identifier equations (10)-(18), omit- 
ting the DY (.) term, leads to the next local linear plant model, extracted from the 
complete KF RNN model: 
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X(k + l) = A 1 X(k) + BU{h) 
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(84) 
(85) 



Z(k) = HX(k); H = CG , (Z) 
Where G'(-) is the derivative of the activation function and L = M, is supposed 
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Fig. 4. Block - diagram of the closed-loop system containing KF RNN identifier and a SMC 

In [19], the sliding surface is defined with respect to the state variables, and the 
SMC objective is to move the states form an arbitrary space position to the sliding 
surface in finite time. 

In [20], the sliding surface is also defined with respect to the states but the 
states of the SISO systems are obtained from the plant outputs by differentiation. 
In [21], the sliding surface definition and the control objectives are the same. The 
equivalent control systems design is done with respect to the plant output, but the 
reachability of the stable output control depended on the plant structure. 

In [13], the sliding surface is derived directly with respect to the plant outputs 
which facilitated the equivalent SMC systems design. Let us define the following 
sliding surface equation as an output tracking error function: 



S(k + l) = E(k + l) + Y j y.E(k-i + l); M <1 



(86) 



Where: S(.) is the Sliding Surface Error Function (SSEF) defined with respect to 
the plant output; E(.) is the systems output tracking error; y are parameters of the 
desired stable SSEF; p is the order of the SSEF. The tracking error in two con- 
secutive moments of time is defined as: 



E(k)=R(k)-Z(k);E(k + l)=R(k + l)-Z(k + l) 



(87) 



Where: R (k), Z (k) there are L-dimensional reference and output vectors of the 
local linear plant model. The objective of the sliding mode control systems design 
is to find a control action which maintains the systems error on the sliding surface 
which assure that the output tracking error reaches zero in P steps, where P < N. 
So, the control objective is fulfilled if: 
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S(k + l) = 



(88) 



Now, let us to iterate (85) and to substitute (84) in it so to obtain the input/output 
local linear plant model, which yields: 



Z(k + l) = FX(k + l) = F\AX(k) + BU(k)~\ 

From (86)-(87), and (89) it is easy to obtain: 

p 
R (k + 1) - Z (fc + 1) + £ YE (k - i + 1) = 

The substitution of (89) in (90) gives: 

p 
R{k + l)-FAX(k)-FBU{k) + Y J 7>E{k-i + l) = 



(89) 



(90) 



(91) 



As the local approximation plant model (84), (85), is controllable, observable and 
stable (see [13], [14]), the matrix A t is diagonal, and L = M, then the matrix prod- 
uct (HB), representing the plant model static gain, is nonsingular, and the plant 
states X(k) are smooth non-increasing functions. Now, from (91) it is easy to ob- 
tain the equivalent control capable to lead the system to the sliding surface which 
yields: 



U (k) = (FBY 1 



-FAX (k) + R(k + l) + Y j y ; E(k-i + l) 



(92) 



Following [19], the SMC avoiding chattering is taken using a saturation function 
instead of sign one. So the SMC takes the form: 



U\k) 



U (k) if \\U (k)\\ < U n 

eg \ I || eq \ /|| 

-UU (k)/\\u (fc)| if \\U (k)\\>U n 

eq \ ) I \\ eq \ / eq \ / 



(93) 



The SMC substituted the multi-input multi-output coupled high order dynamics of 
the linearized plant with desired decoupled low order one. 



5 Description of the CSTR Bioprocess Plant 

The CSTR model given in [16], [17] was chosen as an example of RNN applica- 
tions in system identification and control of biotechnological plants. Numerical 
values for the parameters and nominal operating conditions of this model are 
given in Table 1 . 
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The CSTR is described by the following continuous time nonlinear system of 
ordinary differential equations: 

dC A (t) o 



dt 



V 



(C Af -C A (t))- 



-koC A (t)exp 



E 



RT(t) 



(94) 



Table 1. Parameters and operating conditions of the CSTR 



Parameters 



Parameters 



Parameters 



Q = 100 (L 1 min) 


E J R = 9.95 x 10 :! (K) 


C AJ =1.0 (mol/L) 


-AH = 2xlO i (cal/mol) 


T f = T fc = 350 (K) 


pp c = 1000 (g/L) 


V = 100 (L) 


CC ec =l (cal/gK) 



Q e0 = 103.41 (L J min) 
hA = 7x10' (cal/mmK) 

T Q = 440.2 (K) 
k, = 7.2 X 10 10 (1 / min) 



dt V ■' pC v \RTit)) 



PC 

pc p v 



QM 



l-exp 



-hA 



K QM)pc pi:J 



(t,-T{t)) 



(95) 



In this model it is enough to know that within the CSTR, two chemicals are mixed 
and that they react in order to produce a product compound A at a concentration 
CaW, and that the temperature of the mixture is T(t). The reaction is exothermic 
and it produces heat which slows down the reaction. By introducing a coolant 
flow-rate Q c (t), the temperature can be varied and hence the product concentration 
can be controlled. Here CAt is the inlet feed concentration; Q is the process flow- 
rate; Tf and T ef are the inlet feed and coolant temperatures, respectively. All this 
variables are assumed constant at nominal values. Likewise, kg, E/R, V, AH, p, 
C pc , C p , and p c are thermodynamic and chemical constants related to this particu- 
lar problem. The quantities Q c0 , T , and C A o, shown in Table 1, are steady values 
for a steady operating point in the CSTR. The objective was to control the product 
compound A by manipulating Q c (t). The operating values were taken from [16] 
and [17], where the performance of a NN control system is reported. 



6 Simulation Results 



Some simulation results of the CSTR biotechnological plant neural identification 
and control are summarized in this part. 
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6.1 Simulation Results of Bioprocess Plant Neural Identification 

Results of detailed comparative graphical simulation of CSTR KFRNN plant iden- 
tification by means of the BP and the L-M learning are given in Fig. 5 and Fig. 6. A 
10% white noise with different variance (SEED parameter of MATLAB) for each 
run was added to the plant inputs and outputs and the behavior of the plant identi- 
fication was studied accumulating some statistics of the final MSE% (^ av ) for 
KFRNN BP and L-M learning. The results for 20 runs are given in Tables 3 and 4. 




5000 



Fig. 5. Graphical results of identification using BP KFRNN learning, a) Comparison of the 
plant output (continuous line) and KFRNN output (pointed line); b) state variables; c) com- 
parison of the plant output (continuous line) and KFRNN output (pointed line) in the first 
instants; d) MSE% of identification 




5000 



Fig. 6. Graphical results of identification using L-M KFRNN learning, a) Comparison of the 
plant output (continuous line) and KFRNN output (pointed line); b) state variables; c) com- 
parison of the plant output and KFRNN output in the first instants; d) MSE% of identification 
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The mean average cost for all runs (e) of KFRNN plant identification, the stan- 
dard deviation (o) with respect to the mean value, and the deviation (A) are pre- 
sented in Table 2 for the BP and L-M algorithms. They were computed by the 
formulas: 



1 " (T 



IL, <7 = .-IA ) ,A=Z av -e (96) 

nk=i k Vn'=i 



Table 2. Standard deviations and mean average values of identification validation using the 
BP and L-M algorithms of KF RNN learning 



BP algorithm L-M algorithm 

s =0.9457 s =0.8264 

a =0.0416 a =0.0188 



Table 3. MSE% of 20 runs of the identification program using the KFRNN BP algorithm 



No 


1 


2 


3 


4 


5 


MSE% 


0.9559 


0.9654 


0.8821 


0.9614 


0.8798 


No 


6 


7 


8 


9 


10 


MSE% 


0.9444 


0.9591 


0.9700 


0.9685 


1.0034 


No 


11 


12 


13 


14 


15 


MSE% 


0.8523 


0.8105 


0.9863 


0.9038 


1.0122 


No 


16 


17 


18 


19 


20 


MSE% 


0.9688 


0.8630 


0.8624 


0.8521 


0.8898 



Table 4. MSE% of 20 runs of the identification program using the KFRNN L-M algorithm 



No 


1 


2 


3 


4 


5 


MSE% 


0.8123 


0.8001 


0.8553 


0.8360 


0.8149 


No 


6 


7 


8 


9 


10 


MSE% 


0.8072 


0.8072 


0.8285 


0.8236 


0.8037 


No 


11 


12 


13 


14 


15 


MSE% 


0.8659 


0.8105 


0.8269 


0.8218 


0.8118 


No 


16 


17 


18 


19 


20 


MSE% 


0.8628 


0.8226 


0.8514 


0.8288 


0.8280 



The numerical results given in Tables 2, 3, and 4 are illustrated by the 
bar-graphics in Figures 7 a, b. 
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0.5 



20 




Fig. 7. Comparison between the final MSE% for 20 runs of the identification program: a) 
using BP algorithm of learning, b) using L-M algorithm of learning 



The comparative results showed inferior MSE%, e, and o for the L-M algorithm 
with respect to the BP one. 

6.2 Simulation Results of Bioprocess Plant Adaptive Neural Control 

The graphical simulation results of DANC using the L-M algorithm of learning are 
presented in Fig. 8 where the final MSE% was 0.854% for the L-M algorithm of 
learning. Similar results are given on Fig.9 for the indirect SMC control. The final 
value of the MSE% obtained for the indirect SMC using the L-M algorithm of learn- 
ing for the KFRNN identifier is of 0.434%. The graphical results and the obtained 
final MSE% showed that the indirect SMC control is about twice times more precise 
that the DANC due to the utilization of the estimated states and parameters in 




;::: 
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Fig. 8. Detailed graphical simulation results of CSTR plant DANC using L-M learning, a) 
comparison between the plant output and the reference signal; b) comparison between the 
plant output and the reference signal in the first instants; c) control signal; d) MSE% of 
control 
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Fig. 9. Detailed graphical simulation results of CSTR plant Sliding Mode Indirect Control 
using L-M KFRTNN learning, a) comparison between the plant output and the reference 
signal; b) comparison between the plant output and the reference signal in the first instants; 
c) control signal; d) MSE% of control 



that case, and also due to the SMC algorithm of control which substitute the plant 
dynamics by a decoupled lower order one. 



7 Conclusions 



The paper proposed a new KFRNN model for system identification and state esti- 
mation of nonlinear plants. The KFRNN is learnt by the first order BP and by the 
second order L-M recursive learning algorithms. The validating results of system 
identification reported here gave priority of the L-M algorithm of learning over the 
BP one which is paid by augmented complexity. The estimated states and parame- 
ters of the plant, obtained by this Kalman filter recurrent neural network model are 
used for direct and indirect adaptive trajectory tracking control system design. The 
applicability of the proposed neural control system, learnt by the BP and L-M al- 
gorithms, was confirmed by simulation results with a CSTR plant. The results 
showed good convergence of the two algorithms applied. The graphical and nu- 
merical validation identification results showed that the L-M algorithm of learning 
is more precise but more complex then the BP one. The control results of DANC 
and IANC (SMC) showed a great precision of reference tracking (the final MSE% 
is 0.854% for the DANC and 0.434% for the indirect SMC). The better results ob- 
tained with the indirect SMC are due to the utilization of the estimated states and 
parameters in that case, and also due to the SMC algorithm of control which sub- 
stitute the plant dynamics by a decoupled lower order one. 
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Abstract. The number of applications in the industry using PID controllers is big- 
ger than fuzzy controllers. One reason is the problem of the tuning, because it im- 
plies the handling of a great quantity of variables like: the shape, number and 
ranges of the membership functions, the percentage of overlap among them and 
the design of the rule base. The problem is more complicated when it is necessary 
to control multivariable systems due that the number of parameters. The impor- 
tance of the tuning problem implies to obtain fuzzy system that decrease the set- 
tling time of the processes in which it is applied, or in some cases, the settling time 
must be fixed to some specific value. In this paper a very simple algorithm is pre- 
sented for the tuning of a type-2 fuzzy controller using only one variable to adjust 
the performance of the system. The results are based on the relation that exists be- 
tween the shape of the membership functions and the settling time. Some simula- 
tions are presented to illustrate the proposed algorithm. 

1 Introduction 

The implementation of a fuzzy controller is not so complex. The knowledge of the 
dynamics of the system can be used to find a very good estimation of the rules and 
the sets of membership functions. But to find an optimal and well tuned fuzzy con- 
troller is another problem. Normally the methodology for tuning fuzzy controller is 
heuristic work and for every problem it is necessary to consider some elements like: 
the bandwidth, the error in steady state, or the settling time. In some cases it is pos- 
sible to use this information to find the optimal parameters of the controller. In the 
case of a PID controller it is necessary to find three parameters (proportional gain, 
derivative time, and integral parameters). In the case of fuzzy controllers, there are 
many parameters to compute like, number of membership functions used, the ranges 
of every function, the rules, the shape, the percentage of overlap, etc. [1] [2] [3]. 
Many researchers prefer to use the very well known PID controller instead of a 
fuzzy controller due the tuning complexity. Many times this is a very important rea- 
son to avoid the use in the industry of this type of "intelligent" controller. 

The tuning of any type of controller implies the adjustment of the parameters to 
obtain a desired behavior or a good approach with a minimal error of the desired 
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response. The different methods published in the area for the problem of fuzzy 
controllers' tuning use methodologies like evolutionary computation [1] [2] [3] [4] 
and artificial neural networks [5] [6] [7] [8] [9]. These methods search the solution 
according to objective functions, parameter estimation, gradient error, etc., but in 
many cases these alternatives have serious convergence problems and a very com- 
plex mathematical representation. The computation time is big, or it is possible 
that the solution computed is only a local minimum of the general solution. The 
problem is even more complex for type-2 fuzzy logic controllers because there are 
more design parameters to find. 

In this paper a very simple method for tuning type-2 fuzzy controllers is pre- 
sented using only two parameters. In this case, the paper is based in the relation 
between the stabilization time and the range of thetype-2 membership functions. 
This paper uses the results of a previous work [10] using only one parameter for 
tuning all the membership functions. The paper is structured in the following way: 
In section 2 the relationship that exists between the positions of the membership 
functions with the transfer characteristic is presented. In section 3 the non-linear 
system used is described with the controller's description for the tuning. The sec- 
tion 4 outlines an algorithm of parametric tuning that modifies the operation points 
that define the group of membership functions. In the section 5 the results of the 
simulations are shown for different values of the tuning factor and different graph- 
ics that show the behavior of the settling time in function of the tuning factors and 
finally the conclusions of the work. 

2 Type-2 Fuzzy Logic Systems 

If for a type- 1 membership function, as in Figure 1 , we blur it to the left and to the 
right, as illustrated in Figure 2, then a type-2 membership function is obtained. In 
this case, for a specific value x' , the membership function ( u' ), takes on different 
values, which are not all weighted the same, so we can assign an amplitude 
distribution to all of those points. 




Fig. 1. Type-1 membership function 



Simple Tuning of Type-2 Fuzzy Controllers 



105 




Fig. 2. Blurred type-1 membership function 



Doing this for all X G X , we create a three-dimensional membership function 
-a type-2 membership function- that characterizes a type-2 fuzzy set. A type-2 
fuzzy set A , is characterized by the membership function: 



A = {((x,u),jU a (x,u))\Vxe Xyus J x c[0,l]} 
in which < jil-^ (x, u) < 1 . Another expression for A is, 

A=\ \ u~(x,u)l{x,u) J r C[0,1] 

JxeXJueJ, A 



(1) 



(2) 



Where denotes the union over all admissible input variables x and u. For 

discrete universes of discourse is replaced by y, ■ I n f act J ' x C [0,1] 

represents the primary membership of x, and jil^(x,u) is a type-1 fuzzy set 

known as the secondary set. Hence, a type-2 membership grade can be any subset 
in [0,1], the primary membership, and corresponding to each primary membership, 
there is a secondary membership (which can also be in [0,1]) that defines the pos- 
sibilities for the primary membership. Uncertainty is represented by a region, 
which is called the footprint of uncertainty (FOU). When 

jU^(x,u) = 1,VmG J x C [0,1] we have an interval type-2 membership func- 
tion, as shown in Figure 3. The uniform shading for the FOU represents the entire 
interval type-2 fuzzy set and it can be described in terms of an upper membership 

function/?^ (x) and a lower membership function jil , (x) . 
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Fig. 3. Interval type-2 membership function 



A FLS described using at least one type-2 fuzzy set is called a type-2 FLS. 
Type-1 FLSs are unable to directly handle rule uncertainties, because they use 
type-1 fuzzy sets that are certain. On the other hand, type-2 FLSs, are very useful 
in circumstances where it is difficult to determine an exact membership function, 
and there are measurement uncertainties. 

It is known that type-2 fuzzy sets enable modeling and minimizing the effects 
of uncertainties in rule-based FLS. Unfortunately, type-2 fuzzy sets are more dif- 
ficult to use and understand than type-1 fuzzy sets; hence, their use is not wide- 
spread yet. As a justification for the use of type-2 fuzzy sets are mentioned at 
least four sources of uncertainties not considered in type-1 FLSs: 



1 



The meanings of the words that are used in the antecedents and consequents of 
rules can be uncertain (words mean different things to different people). 
Consequents may have histogram of values associated with them, especially 
when knowledge is extracted from a group of experts who do not all agree. 
Measurements that activate a type-1 FLS may be noisy and therefore uncertain. 
The data used to tune the parameters of a type-1 FLS may also be noisy. 



All of these uncertainties translate into uncertainties about fuzzy set membership 
functions. Type-1 fuzzy sets are not able to directly model such uncertainties be- 
cause their membership functions are totally crisp. On the other hand, type-2 
fuzzy sets are able to model such uncertainties because their membership func- 
tions are themselves fuzzy. A type-1 fuzzy set is a special case of a type-2 fuzzy 
set; its secondary membership function is a subset with only one element, unity. 
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A type-2 FLS is again characterized by IF-THEN rules, but its antecedent or 
consequent sets are now of type-2. Type-2 FLSs, can be used when the circum- 
stances are too uncertain to determine exact membership grades such as when the 
training data is corrupted by noise. Similar to a type-1 FLS, a type-2 FLS includes 
a fuzzifier, a rule base, fuzzy inference engine, and an output processor, as we can 
see in Fig. 4. The output processor includes type-reducer and defuzzifier; it gen- 
erates a type-1 fuzzy set output (from the type-reducer) or a crisp number (from 
the defuzzifier). Now we will explain each of the blocks of Figure 4. 



CRISP 
INPUT 



xeX 



RULES 



3 FUZZIFIER 



$ INFERENCE 



FUZZY INPUT 
SETS 



OUTPUT 
PROCESSING 



I 



I DEFUZZIFIER 



I J 

TYPE- REDUCER -1 



FUZZY OUTPUT 
SETS 



CRISP 

OUTPUT 
v = f{x) S Y 
> 



TYPE 

REDUCED 

SET 



Fig. 4. Type-2 Fuzzy Logic System 



2.1 Fuzzifier 



The fuzzifier maps a crisp point x=(x b ...,x p ) T e X l xX 2 x...xX p = X into a type-2 
fuzzy set A in X, interval type-2 fuzzy sets in this case. We will use type-2 sin- 
gleton fuzzifier, in a singleton fuzzification, the input fuzzy set has only a single 
point on nonzero membership. A x is a type-2 fuzzy singleton if jil~ (x) = 1/1 

for x=x' and jil~ (x) = 1/0 for all other x#x'. 



2.2 Rules 

The structure of rules in a type-1 FLS and a type-2 FLS is the same, but in the lat- 
ter the antecedents and the consequents will be represented by type-2 fuzzy sets. 
So for a type-2 FLS with/5 inputs x t E X t ,...,x p £l p and one output ye Y, Multi- 
ple Input Single Output (MISO), if we assume there are M rules, the Zth rule in the 
type-2 FLS can be written as follows: 



R : IF jci is P{ and --and x p is F p ' , THEN y is G 
1=1,.. „M 



~i 



(3) 
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2.3 Inference 

In the type-2 FLS, the inference engine combines rules and gives a mapping from 
input type-2 fuzzy sets to output type-2 fuzzy sets. It is necessary to compute the 

join U, (unions) and the meet l~l (intersections), as well as extended sup-star com- 
positions (sup star compositions) of type-2 relations. lfF\X---xF p =A , equa- 
tion (3) can be re-written as 



R' :Fix-xf p ->&=&' ->G' 1=1,.. .M 



(4) 



R l is described by the membership function ju R , (x, y) = jii R , (x x ,..., x p , y) , where 

[l Rl (x,y)=M x i^ G i(x,y) (5) 

can be written as: 

M R i (x, y) = M a i^ g i (x, y) = M F i (*i) n — n Mp > (* p ) n M G > O) 

= [nf =1 //^.(x,.)]n// G ,(y) 



(6) 



In general, the /^-dimensional input to R is given by the type-2 fuzzy set A x whose 
membership function is 



Ma x O) = Mx l (*i) n • ' • n Mxp ( x p ) = n f=i thi ( x i ) 



(7) 



where K i {i = \,...,p)ax& the labels of the fuzzy sets describing the inputs. Each 
rule R 1 determines a type-2 fuzzy set B = A x ° R such that: 

Vb' W = <%«< = U ** K (x) n <"«< (x ' ^J ye y Z= 1 , . . . ,M (8) 

This equation is the input/output relation in Figure 4 between the type-2 fuzzy set 
that activates one rule in the inference engine and the type-2 fuzzy set at the out- 
put of that engine. 

In the FLS we used interval type-2 fuzzy sets and meet under product t-norm, 
so the result of the input and antecedent operations, which are contained in the fir- 
ing set I"! f-iJUp (x i = F (x') , is an interval type-1 set, 



F'(X'): 



/ ' («'),/ (x') 



/ ,/ 



(9) 



Where 



f'(x')= M (x l )*••■*// (x ) 



(10) 
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And 

_/ 

/ (x')=ju Fil (xO*-*]Up p >(x p ) < n ) 

where * is the product operation. 

2.4 Type Reducer 

The type-reducer generates a type-1 fuzzy set output, which is then converted in a 
crisp output through the defuzzifier. This type-1 fuzzy set is also an interval set, 
for the case of our FLS we used center of sets (cos) type reduction, F cos which is 
expressed as: 

Fcos(*) = [:v/.:>g=f v . e , v . v .,-f v « sr v « v «,f -. -( -m 1/ 4^ (12) 

J ydn.y,] J > =[» .-v ]J /le[/lj , J / « E[/ « J , V" /'' 

this interval set is determined by its two end points, v/ and y r , which corresponds 
to the centroid of the type-2 interval consequent set G , 



C & 



Li J' 



before the computation of F cos (x), we must evaluate equation (13), and its two 
end points, v; and y r . If the values of/ and v, that are associated with y; are de- 
noted// and yl, respectively, and the values of/- and y,- that are associated with y r 
are denoted/' and y r \ respectively, from equation (13), we have 



Z M f i i 

LiJi 



iJr 



y r = ~±L . d5) 



The values of yi and y r define the output interval of the type-2 fuzzy system, which 
can be used to verify if training or testing data are contained in the output of the 
fuzzy system. We will consider this measure of covering the data as one of the de- 
sign criteria in finding an optimal interval type-2 FS. The other criteria, is that the 
length of this output interval should be as small as possible to be optimal. 



110 



E. Gomez-Ramirez, P. Melin, and O. Castillo 



2.5 Defuzzifier 



From the type -reducer we obtain an interval set Y coli , to defuzzify it we use the av- 
erage of v; and y r , so the defuzzified output of an interval singleton type-2 FLS is 



y(x): 



yi + y r 



(16) 



In this paper we will be more interested on the left and right bounds of the type- 
reduction as we will be checking if the output interval is covering the training or 
testing data. In this respect deffuzification is not as relevant. 



3 Type-2 Fuzzy Controller Characterization 

The fuzzy controller's behavior, considering its output speed, sensitivity and reac- 
tion under disturbances can be described using the transfer characteristic and the 
position of the operation points [10]. This is related with the choice of the fuzzy 
controller's gain dy I dx (where y is the output and x is the input of the system), 
in different regions of the domain x. 



Case 1. For a flat slope in the middle of the domain x and increasing slopes toward 
increasing \x\ values, choose larger distances between operations points in the 
middle of domain (see figure 5). This means: 



For 



x 2 I 



>\x 1 \=>\dy/dx\ > \dy I dx\ x 



(1) 



Case 2. For a steep slope in the middle of the domain x and decreasing slopes to- 
ward increasing Ixl values, choose smaller distances between operations points in 
the middle of domain (see figure 6). This means: 



For 



> I JC, 



I dy I dx I 



< 



\dy I dx \ x 



(2) 




Fig. 5. Relationship between the location of the membership functions and the transfer 
characteristic for the case 1 
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Fig. 6. Relationship between the location of the membership functions and the transfer 
characteristic for the case 2 



Option 1 should be chosen if for small errors a slow reaction to disturbances of the 
system under control is required. Option 2 should be chosen if for small errors the 
system is supposed to be sensitive with respect to disturbances. 

In the previous figures the values for the intervals of the membership function 
are important for the slopes and the speed of the controller response. If the mem- 
bership functions "expand" (figure 5) then the response is slower than a compress 
group of membership functions (figure 6). 

4 Dynamic System and Fuzzy Controller 

For the analysis and simulations with the tuning algorithm a second order system 
has been considered: 

1 (3) 



0.45s 2 +2.5 + 1 

overdamped with a damping ratio E, = 1 .4907 and a natural frequency 
co n =1.4907 radls. 

The fuzzy controller designed for the control of the plant described previously 
is a system TISO (two inputs-one output) where the inputs are the error and the 
change of error while the output is the control action. Each one of the controller's 
variables has been divided in 5 fuzzy regions. The fuzzy associative memory, 
integrated by 25 rules, it is shown in the Figure 8 and the surface in Figure 7. 



Table 1. Controller Fuzzy variables 





Input 


variables 




Output variable 


error 


change of error 


control action 


GN: 


Big negative 


GN: 


Big negative 


DG: Big diminution 


MN: 


Medium negative 


MN: 


Medium negative 


DP: Small diminution 


Z: 


Zero 


Z: 


Zero 


M: Hold 


MP: 


Medium positive 


MP: 


Medium positive 


AP: Small increase 


GP: 


Big positive 


GP: 


Big positive 


AG: Big increase 
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Fig. 7. Example of control surface of a fuzzy controller 
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Fig. 8. Fuzzy associative memory for the control system 

The membership functions were defined in triangular shape for the middle and 
in a trapezoidal shape in the extremes; such that always have it overlap in the 
grade of membership jil(x) = 0.5 . These membership functions will be consid- 
ered later as the initial conditions for the proposed algorithm. The control surface 
for the fuzzy controller under its initial conditions is similar to Figure 9. 




Fig. 9. Control surface for the fuzzy controller with the membership functions under their 
initial conditions 
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The output of the system, with a step input of amplitude 40, is shown in figure 10. 
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Fig. 10. Output of the system with the membership functions under their initial conditions 



4.1 Tuning Algorithm 

The objective of the tuning algorithm is to be able to manipulate, by means of a 
single variable and in a simple way, the settling time of the system, from the an- 
swer without controller until the response equivalent to 1/5 of the settling time of 
the answer without controller. The response must be fulfilled too with the con- 
straints of small overshoots and without persistent oscillations, which means, a 
very smooth response. 

This algorithm is based on the properties of the transfer characteristic or, in this 
case, of the control surface that it allows to modify the controller's behavior by means 
of modifications in the position and support of the membership functions maintaining 
fixed the fuzzy controller's structure. Obtaining a slower answer for configurations 
with wide or expanded membership functions in the center (see fig. 1) and reduced in 
the ends, and the other way, a faster answer for configurations with reduced or com- 
pressed membership functions in the center and wide in the ends (fig. 2). 

The tuning algorithm only modifies the membership functions of the input vari- 
ables and the membership functions of the output fuzzy variable remains constant 
since this disposition is only in function of a proportion of the range of the control 
action, in other words, they always remain uniformly spaced. 

4.2 Tuning Factor Selection 

The tuning factor is a number k € [0, lj that determines the grade of tuning ad- 
justment obtaining for k = the biggest settling time (fig. 1) and for k = 1 the 
smallest settling time (fig. 2). 

4.3 Normalization of the Ranges of the Fuzzy Controller's Variables 

In this step the range of each input fuzzy variable is modified so that their upper 
and lower limits are equal to +1 and -1, respectively. 
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4.4 Tuning Factor Processing 

When the range is normalized in the range between and 1 the values can be 
computing using a power function, because to expand it is only necessary to use 
an exponent less than 1 and to compress an exponent more than 1 (see fig. 12). 
Recall the previous step that all the values belong to the interval [-1,1]. In an ex- 
perimental way different values of exponents were tested (table 2) such that the 
new vector of operation points will be given by: 



Vop final =(Vop initial ) 



r(k) 



(4) 



Where Vop inilia i are the values normalized of the membership function in the x-axis 
and r(k) is a polynomial. 

Table 2. Important values of r(k). 



k 


r 





1/40 


0.5 


1 


1 


3 



r(k) A 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 k 

Fig. 11. Plot of function r(k) 



The initial coefficients of the polynomial were obtained using mean square me- 
thod. The values of k were defined in this way to be able to make an estimate over 

all their range k G [0, lj . The values of r, since it is an exponent, they were de- 
fined considering the increasing or decreasing of a number that is powered to the 
exponent r. Remember that the goal is to expand (slow response) for k=0 and to 
compress (fast response) k=l. For values below r = 1/40 the answer of the system 
was not satisfactory and in the same way, for values more than r = 3. With all 
these elements the polynomial obtained was: 
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r(k) = 



30k 3 +37 k 2 + 52k + l 
40 



(5) 



This r(k) was found testing the optimal response for different dynamical systems 
(linear and non-linear) and finding the optimal parameters of the polynomial that 
fix the function for different values of k (k = 0, 0.5, 1) (figure 11). 

To visualize the effect of this processing it is useful the graph of curves of ad- 
justment for the vectors of operation points (figure 12). The values that can take 
an operation point (positive section) are in the horizontal axis and in the vertical 
axis, the values that takes this operation point once it has been powered to the ex- 
ponent r(k) where k G [0, lj . 




Fig. 12. Curves of adjustment for the operation points 

4.5 Denormalization of the Ranges of the Fuzzy Variables 

In this step it is necessary to convert the normalized range to the previous range of 
the system. This can be computed only multiplying the Vop vector by a constant 
factor. 



5 Results of the Simulation: The Same Gain for Both Inputs 



The cases will be analyzed for 3 different values of k, k = 0, 0.5, 1, showing the ef- 
fect in the membership functions of the fuzzy variables, the control surface and the 
graph result of the simulation. For all the analyzed cases it will be used as input a 
step function with amplitude 40, the parameters that allow evaluating the quality 
of the tuning are the settling time (considered to 98% of the value of the answer in 
stationary state), the overshoots and the oscillations. Also for all the analyzed 
cases the controller's structure is fixed, that means, the fuzzy associative memory 
is the same in all the examples. In the simulations it is included (on-line dotted) 
the answer of the system without controller to compare with the response using 
different tunings of the controller. The controller's fuzzy variables and the 
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membership functions for the initial conditions are shown in the figures 6, 7 and 8: 
error, change of error and control action respectively. 

5.1 Case 1: Adjusting the Membership Functions with a Tuning Factor k = 

The function r(lc) takes the value r(0) = 1/40. With the tuning process the vectors 
of operation points for the fuzzy input variables are the following ones: 

Vop error fhml = [-59.3947,-58.3743, 0, 58.3473, 59.3947] 
Vop d I dt{error) flnal = [-19.7982,-19.4581, 0, 19.4581, 19.7982] 





n a 40 




control action 



contra! surface 



Fig. 13. Membership functions of the fuzzy variables and control surface for k = 

Making the simulation with the controller's characteristics shown in the figure 13 
the following answer was obtained: 
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Fig. 14. Answer of the system for the case 1 with k = 



In the figure 14 it is shown that with the tuning factor k = the controller's ef- 
fect on the answer of the system, due to the tuning, is small, approaching to the 
answer without controller. In this case the settling time is the biggest that can be 
obtained, t, = 4.96 s. 
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5.2 Case 2: Adjusting the Membership Functions with a Tuning Factor k = 0.5 

The function r(k) takes the value r(0.5) = 1. With the tuning process the vectors of 
operation points for the fuzzy input variables are the following ones: 

Vop error final = [- 40, - 20, 0, 20, 40] 

Vop d I dt{error) flnal = [-13.332, -6.666, 0, 6.666, 13.332] 




control surface 



Fig. 15. Membership functions of the fuzzy variables and control surface for k = 0.5 

Computing the simulation with the controller's characteristics shown in the 
figure 15 the following answer was obtained: 
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Fig. 16. Output of the system for the case 2 with k = 0.5 



This case, with the tuning factor k = 0.5, is equal to operate with the initial con- 
ditions of the membership functions. The settling time is t s = 3.36 s. 

5.3 Case 3: Adjusting the Membership Functions with a Tuning Factor k = 1 

The function r(k) takes the value r{\) = 3. With the tuning process the vectors of 
operation points for the fuzzy input variables are the following ones: 
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Vop error fmal = [- 17.7724, - 2.2215, 0, 2.2215, 17.7724] 
Vop d I dt(error) fmal = [-5.9241,-0.7405, 0, 0.7405, 5.924 1] 

Computing the simulation with the controller's characteristics shown in the figure 17 
the answer was obtained in figure 18, where the settling time is t s = 1.6 s and it is the 
less value that can be obtained. 




. ■ 



Fig. 17. Membership functions of the fuzzy variables and control surface for k = 1 

The controller's effect on the answer of the system has begun to cause a small 
overshoot, due the bigger compression of the membership functions. If the value 
of r(k) is increased, the settling time is not reduced and it only causes bigger over- 
shoots and oscillations around the reference. 

To visualize the effect of different values of the tuning factor k over the settling 
time of the answer of the system simulations with increments Ak = 0.05 in the 
interval [0, lj were computed. The result is shown in figure 18. 
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Fig. 18. Settling time versus tuning factor k 
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Fig. 19. Comparative graph of the system answers for different values of k 

Figure 19 shows the behavior of the output of the system for incre- 
ments Ak = 0.1. The next curve to the left is the corresponding for a tuning fac- 
tor k = 1, with a settling time t s = 1.6 s. For each increment, beginning in k = 0, a 
curve was plotted showing in the figure a smaller settling time. 

Making use of the simulations, and the graphs in figures 19 and 20, it is possi- 
ble to see that the optimal value of A: for the tuning is k = 0.9. This value generates 
a settling time t„ = 1.6 s without a great overshoot and without oscillations 
(fig. 21). 

Additionally, the fuzzy controller's performance was compared with a control- 
ler PID (Proportional-integral-derivative) whose parameters are the following ones 
K p = 25, Tj = 1.35 and Tj = 5, and being that the differences are minimum as for 
time of establishment and general behavior (figures 20 and 21). 

The disadvantage found in the controller PID is its inefficiency in comparison 
with the fuzzy controller since the control action generated by the PID can take 
very big values that are impossible to consider in a real implementation. On the 
other hand, the fuzzy controller uses real range of values. 
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Fig. 21. Comparative graph of fuzzy controller answer versus PID 

Considering that this is the fastest output that can be gotten with the controller 
PID, limited to the nature of the system, that is to say, limiting the range of the val- 
ues that can take the control action to same values that those considered in the fuzzy 
controller's definition, it can be said that the tuning made on the fuzzy controller is 
satisfactory since it allows to vary the time of answer with very good behavior in the 
whole range of the tuning factor. Note that it is not evident to find three parameters 
of the PID for the optimal tuning and in the case of the fuzzy controllers it is neces- 
sary to increase or decrease the parameter k depending the settling time desired. 

5.4 Simulation Results: Different Gains for Every Input 

In this case a different gain k was considered for every input. The methodology 
used is exactly the same of the previous sections. The next figures show the stabi- 
lization time, the gain for the error kj and the gain used for the derivative of the er- 
ror Abusing aAk of 0.01. For the axes k } and k 2 the value of 21 is equivalent to 1 in 
the figures 22, 24 and 25. Fig. 19 can be obtained for the case of kj=k 2 in this sur- 
face (see fig. 23). 




Fig. 22. Settling time versus tuning factor kj and k 2 
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Fig. 23. Settling time versus tuning factor k] = k 2 




Fig. 24. View for tuning factor k 2 
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Fig. 25. View for tuning factor k t 

With this option the controller has other degree of freedom to control the re- 
sponse of the dynamic system. Fig. 25 shows how the effect of tuning factor k 2 , is 
less important than factor kj. Keeping the factor ki=l, the response of the system 
is in the interval 1.8-2.0 sees. 



6 Conclusions 

The tuning methods of fuzzy controllers include the handling of a great quantity of 
variables that makes very difficult, and many times non satisfactory the search of 
structures and good parameters. The method proposed uses only one variable and 
operates considering the transfer characteristic, or in this case the control surface 
that is the fuzzy controller's property that defines their behavior allowing that the 
system can response with bigger or smaller speed and precision. 

The function r(k) can be generalized to any system that uses a fuzzy controller 
varying the values r(0) and r(l) as well as the coefficients of the function r(k) de- 
pending on the desired behavior. 

Another perspective is to create a self tuning algorithm that modifies by itself 
the factors kj and k 2 to find the desired response. In this point, the use of fuzzy 
controllers presents attractive aspects for its implementation in real systems. 
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Abstract. Wireless sensor networks (WSN) are an emerging technology based on the 
progress of electrical and mechanical engineering, as well as computer science in the last 
decade. In this work, a FIS (Fuzzy Inference System) approach is used to increase the 
energy efficiency of a preamble sampling MAC (Media Access Control) protocol for WSN. 
The FIS replaces a traditional average approach reducing the number of sampling points 
and the processing time required by the traditional approach. In addition, the FIS is 
designed to have a small memory footprint and fast response; this considering that the FIS 
must be implemented in WSN nodes that use microcontroller that are limited in memory 
and processing speed. The result of using the FIS approach shows up to 90% reduction 
(best case scenario) on the waste of energy consumed by overhearing having an increment 
of energy efficiency. 

Keywords: Fuzzy approach, Wireless, Sensor, Network, microcontroller. 

1 Introduction 

Advances in digital electronics, micro-electro-mechanical systems and wireless 
communications have made possible the development of low power sensing 
devices with short range wireless communication capability at low cost. These 
sensing devices can autonomously collect, process and communicate information 
about their environmental conditions (i.e. temperature, vibration, sound and 
pressure). When a large number of sensor devices collaborate using their wireless 
links with a flow of information from each other they constitute a wireless sensor 
network [1]. 
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In WSNs the sensing devices are called sensor nodes and they operate in a 
self-organized and decentralized manner that maintains the best connectivity as 
long as possible and communicate messages to other nodes via multi-hop 
forwarding (see Fig. 1). 



^^^ Sensor Node 

Fig. 1. Core Components of a WSN: set of distributed sensors (1), interconnection network 
(2), data sink (3) and resources to control, configure and manage the network (4) 

Some early WSN applications were developed by the Defense Advanced 
Research Projects Agency (DARPA) in the U.S. for monitoring possible battle 
scenarios [2], but at present the scientific community has found an increased use 
in civilian applications [1]. WSNs have a wide range of applications due to their 
capabilities for detailed physical monitoring and manipulation offering enormous 
opportunities for almost every scientific discipline. Therefore, the research on this 
particular type of networks has increased in recent years, identifying opportunities 
and challenges for distributed signal processing in sensor devices and architectural 
challenges presented by these systems that are massively distributed, wirelessly 
connected and energy limited [1], [3], [4]. The state of most commercial 
applications is still immature, and there are a few companies that have platforms 
in the market at high cost [5] and [6]. However, given that WSNs are still in 
research stage they offer great opportunities for multidisciplinary science, 
technology, engineering, and mathematics education. 

2 LiSANDRA: An Experimental Wireless Sensor Network 

The University of Baja California (UABC), together with the Centre for Scientific 
Research and Higher Education of Ensenada (CICESE), have perceived as an 
opportunity the development, application, use, and diffusion of this technology. 
Therefore, projects in the research and development of this technology have given 
as results the designing and implementing of own WSN platform called 
LiSANDRA {Lightweight Sensor and Actuator Network with Distance Range 
Auto-adjustable) [7]. This network is an experimental WSN developed in order to 
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create the background and expertise and so assist the teaching-learning process for 
training students, providing them with the appropriate skills and knowledge on 
this technology. 

2.1 Architecture 

LiSANDRA has a multilevel hierarchical architecture, commonly called tiered 
architecture where the sensor nodes form a hierarchy in which a node at a given 
level performs a specific set of tasks on behalf of a subset of nodes in the level 
below. This in contrast to typically used WSNs called flat architecture, which has 
a single level where all sensor nodes are equal and are homogeneous in form and 
function. The reason to select hierarchical architecture is a cost-effectiveness issue 
due to this architecture reduces the cost of sensor nodes by allocating resources 
just where they can be most effectively utilized. These types of architecture and 
others are described widely in [8]. 



WiFi Network 
802.11 b/g 



j # Node-S (sensor) 

I *v (((<*■ Node-SR (sensor & router) 

1 ((((•# Node-G (gateway) 



Fig. 2. Architecture used by LiSANDRA 



As is shown in Fig. 2 the lowest layer in LISANDRA is basically composed by 
the node called sensor node (node-S) and they can only sense and transmit small 
data packets. These nodes can establish communication with just one sensor and 
router node (node-SR) which was previously associated to form a group of sensors 
called sensor cluster. The sensor and router nodes form the middle layer; this node 
type recognizes and interacts with the same type of nodes located in their 
neighborhood to form the primary mechanism for connectivity of the network 
(backbone). The network has another node at the highest layer of architecture 
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called node-G which operates as a gateway for connectivity to an 802.11b/g 
wireless network commonly called Wi-Fi network. This is a high cost node and 
for low cost applications it can set aside and make use of one node-SR as a 
gateway; this can be done by connecting its serial port to a personal computer 
(PC) that executes specific software for connectivity to the traditional wired or 
wireless networks. 

2.2 Nodes Organization 

Essentially the nodes are an embedded system composed of hardware and 
software sections. The hardware section is made up of four basic components 
which are: a sensing unit, a processing unit, a communication unit and a power 
unit. On other hand, the software section is embedded monolithic software called 
firmware composed of software subsystems that manage the hardware units. The 
three type nodes that compose LiSANDRA have similar organization, but this 
document is focused to the nodes-SR because these nodes are the most important 
due to they create the primary mechanism for connectivity of the network. For 
complete information about the all nodes see [9]. 
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Fig. 3. Architecture blocks diagram and pictures of node-SR 



2.2.1 Hardware 

The picture and architecture block diagram of the node-SR is shown in Fig 3. This 
node is built on a 1.25" x 2.50" double side PCB and uses two rechargeable AA 
batteries as power unit. The processing unit uses the ATmega644P [10] 
microcontroller that has 64KB instruction flash memory and 4KB of data memory 
(RAM) and operates at 2 million instruction per second (MIPS) it means haft 
micro-second per instruction. In addition, this section uses an external 32 KHz 
watch crystal as the time base for the RTC (Real Time Clock) and 1MB external 
serial memory flash that works as data logging system. 

The sensing unit is composed of a SHT11 temperature-humidity sensor [11] 
and a TSL2550T ambient light sensor [12]. However, the node hardware design 
considers microcontroller I/O lines as an expansion port that can be used to 
interface actuators and other types of sensors according to particular application 
requirements. 
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The communication unit includes the RF transceiver module TRM-418-LT [13] 
with an antenna embedded into the PCB. Also, there is a wired communication 
section based other the standard RS232 over TTL voltage, and it is use to 
communicate with the PC to download program and/or upload data stored in the 
data logging system. 

2.2.2 Software 

The main functions of node-SR are sense and routing data packets from/to other 
nodes of the same type. In addition this node has the ability to recognize up to 16 
nodes-S that were previously associated to form a cluster sensor. These 
subsystems are the application, communication processing and drivers 
(communication and sensor); all of them shielded from the low-level functionality 
of the components and running by an executive main loop technique. Fig. 4a 
shows the firmware flow chart representing the big tasks activities that the node- 
SR is performing continuously. 
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Fig. 4. Firmware flow charts of node-SR (a) and duty cycle sequence (b) 



The firmware of node-SR is implemented in 90% C and 10% assembly 
language using AVR Studio [10] and WinAVR [14] as development tools. It is 
structured in 3 layers; the first one is the application layer where the user programs 
the access to the network using a simplified network abstraction similar to the 
Berkeley sockets [15] that is the "de facto" standard for network programming. A 
middle layer called network layer is used to route data packets to others node-SR 
of the network. Finally, a physical layer focused on the functionality of the RF 
receiving-transmitting process of raw data transceiver; this in order to execute the 
MAC using a Carrier Sense Multiple Access (CSMA) scheme; so before a 
transmission is attempted, a node that has a data packet to transmit first "listens" 
to the channel to determine if it is busy. If the RF channel is busy, the transmitting 
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node "backs-off" for a random period of time after which it senses the channel 
again. In other hand, for the reception process the node-SR periodically goes into 
the sleep mode during which the radio is turned off. Thus, each node sets a wake- 
up timer and goes to sleep for a short specific period of time. At the expiration of 
the timer, the node-SR wakes up and listens to determine if it needs to receive or 
retransmit data packets to other node-SR. The complete listen-and-sleep cycle is 
referred to as a frame or duty cycle and it is performed every Ts (125mS) that 
corresponds to 10 times per second (see Fig. 4b). 

3 Preamble Sampling MAC Protocol 

Nowadays, one of the most important requirements of the WSN applications is the 
low power consumption, so the nodes require lowering their energy in order to 
support an application for longer periods of time. The protocols type LPL (Low- 
power-listening) form a set of MAC protocols that reduce the idle listening 
concerns, a state of the node when its radio is turned on in receive mode, but not 
receiving any packets. In a LPL protocol, nodes verify the channel every specific 
period of time, and if they do not receive any data during this verification, then 
they return to sleep as was described before (section 2.2.2). 

The protocols as Aloha with preamble sampling (PS) [16], WiseMAC [17], and 
B-MAC [18] were the first LPL proposed protocols. These protocols send data 
packets using very long preamble to guarantee that the intended receiver will stay 
on at the moment the channel is verified (see Fig 5a — The preamble is commonly 
coded as '10101010' for the purpose of leading the data sheer in the receiving end 
to a stable state). However, the protocols are not adapted to recent radios like the 
IEEE 802.15.4 [19] compliant Chipcon CC2420 [20] radio. Therefore, researchers 
introduced new compatible LPL protocols such as X-MAC [21], SpeckMac-D [22], 
and MX-MAC [23]. These new protocols are based on repeating either the data 
packet itself or a special advertisement packet, instead of the long preambles (see 
Fig. 5b). 
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Fig. 5. MAC: using long preamble (a), and using a special advertisement packets (b) 
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These kinds of MAC protocols try to reduce the waste energy due to the 
overhearing produced by the use of the long preamble technic. The overhearing is 
an effect of always listening for incoming traffic, is that a node will receive all 
messages including those that concern its neighbors only. Overhearing these 
messages is simply a waste of energy, and becomes problematic in dense networks 
with many nodes inside the reception range of a node. In this scenario the nodes 
detect the long preamble and they stay wake-up until the packet is received, 
analyzed and detected that the packet is for some neighbor node. Therefore the 
nodes wasted energy during the hearing process of long preamble to be waiting for 
the Start Of Packet (SOP). 

The nodes in LiSANDRA use the MP-MAC [7] (Modulated Preamble MAC) 
protocol type LPL (Low-power-listening) that uses the long preamble approach to 
avoid idle listening concerns. However, MP-MAC introduces a novel 
characteristic that is a mechanism to modulate the long preamble in order to 
minimize the overhearing problem. Thus, the transmitter node applies a 
modulation to the long preamble by changing the period length of the square 
wave; starting with long period length and reducing it proportionally until the SOP 
appears as is shown in Fig. 6. In consequence, the receiver nodes that wake-up are 
able to detect the long preamble and measure the period length of the square wave 
(w); and so to know the time in which the packet will start (SOP). Therefore, the 
receiver nodes knowing the period value (w) can make a decision to wait for the 
SOP if the period is short; or in other case return to sleep mode during the time 
given by the period value, and then to wake-up some time before the SOP appears. 
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Fig. 6. Modulated preamble MAC protocol used in LiSANDRA 



In order to achieve properly this method and get acceptable energy saving 
results is important to take the right measurements and quantity of readings on the 
square wave period length. This means that taking just one measure is not enough 
to make an acceptable decision due to inaccurate measure reading process. In 
other hand, taking high quantity reading and calculate the average will fallout in 
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overhearing problems again. Thus, the modulated preamble MAC protocol used in 
LiSANDRA requires a method that can be able to make a logical decision using 
very low quantity and imprecise data readings. However, the method should be 
supported with certain degree of reasoning based on facts, human knowledge and 
experience. This is the reason for use a fuzzy logic (FL) approach which is 
described in the next section. 



4 Fuzzy Logic Approach 

Since Lotfi Zadeh published his propose about "fuzzy set theory" [24], the 
progresses in number and variety of applications of the fuzzy logic have been 
increasing in the last two decades and supported by advances in the electronics 
and computing fields. The applications range from consumer products to 
industrial process control, instrumentation and decision-support systems. One of 
the successes of the fuzzy logic is due to the FL may be viewed as a methodology 
for computing with words rather than numbers, being the words more imprecise 
than numbers, and their use is much closer to human intuition and so including the 
tolerance for imprecision. In addition, the fuzzy logic has been used in inference 
systems to imitate the human behavior expertise on a particular field or scenario; 
and so to make decisions to accomplish a specific task inferring actions based on 
data, facts and rules. These kinds of systems are called Fuzzy Inference Systems 
(FIS), and they are the process of formulating the mapping from a given input to 
an output using fuzzy logic. The mapping then provides a basis from which 
decisions can be made, or patterns discerned. 
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Fig. 7. Structure of the FIS used in the nodes. 



There are two most common types of FIS, one of these is the Mamdani-type 
proposed in 1975 by Ebrahim Mamdani [25] as an attempt to control a steam 
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engine and boiler combination by synthesizing a set of linguistic control rules 
obtained from experienced human operators. In other hand is the Takagi-Sugeno- 
Kang [26] (most commonly called Sugeno) type introduced in 1985 and is similar 
to the Mamdani in some parts. The first two parts which are the fuzzifying the 
inputs and applying the fuzzy operator (rules) are exactly the same; and they differ 
in the way their outputs are determined. The main difference between Mamdani 
and Sugeno is that the Sugeno output membership functions (MF) are either linear 
or constant. These two types of fuzzy inference systems are describes widely in 
[25], [26] and [27]. 

4.1 Fuzzy Inference System Used in LiSANDRA 

The FIS structure used in the nodes is shown in Fig. 7, and it is a Sugeno type FIS 
[26] with three inputs and one output. The inputs (x\, x 2 and x 3 ) are the values of 
three consecutive period length measures of preamble; and the output (y) is the 
time period during the node could sleep before the SOP appears. The total 
preamble has been divided in 10 equal length categories to create 10 MF which 
are used by the fuzzifier section of the FIS; so the fuzzifier converts crisp inputs 
data (jci, x 2 and x 3 ) into fuzzy data (membership value) that can then be used by the 
if-then rules section. The FIS has an if-then rule section composed by rigorously 
1000 rules due to there are 3 input with 10 MF per input (rules = 10 3 ). However, 
just 9 rules are used at the time by the decision process, this according to the 
following logic facts. 

a) The period of the long preamble is modulated starting with a long period 
length and reducing it to a short length. 

b) Each category (member functions) of the modulated preamble is 
presented by the transmitter node during enough time to accomplish three 
period lengths measures by the receiver nodes. 

c) As effect of a) consecutive period length measures given as result that 
last measure must to be a length equal or shorter than previous ones. 

d) Based in b) and c), the three consecutive measures must be belonging to 
two adjacent member functions as maximum. 

Table 1. If-then rules for twogeneric adjcent member functions A and B 



Rule 




X\ is 


Inputs 

x 2 is x 3 is 




Output 


1 


IF 


A 


A A 


THEN 


y\ = i*i + 3*2 + 4*3 


2 


A 


A B 


y 2 = 2x 1 + 4x 2 + 2x 3 


3 


A 


B A 


y-i = 2*i + 0x 2 + 6x 3 


4 


A 


B B 


y 4 = 0x 1 + 2*2 + 6x 3 


5 


B 


A A 


y 5 = 0*! + 2x 2 + 6x 3 


6 


B 


A B 


y 6 = 2*! + 0*2 + 6x 3 


7 


B 


B A 


y 7 = 2*! + 3*2 + 3x 3 


8 


B 


B B 


y 8 = 1*! + 3*2 + 4x 3 


9 






OTHERWISE 




ERROR 



134 



L. Aguilar et al. 



Therefore, the decision process uses the three inputs and two contiguous 
member functions of the total teen. This gives us a system with 3 inputs with 2 
member function per input resulting in 8 rules. There is an extra rule that to 
represent the otherwise statement (see Table 1) which are the set of rules not fired, 
in other words the complement of fired rules set. The defuzzifier section is 
composed by a set of 8 linear functions (one per rule [yi to y 8 ]), where each 
function depends of the inputs measures (x b x 2 and x 3 ). So each one affects 
proportionally each output function according to its significance in the rule (see 
Table 1 output column). Since each rule has a crisp output, the overall output is 
obtained via weighted sum operator instead of weighted average to reduce 
computational further in the process of defuzzification. 

4.2 Implementation of FIS 

As mentioned in section 2, the node-S has hardware limitations such as low 
performance and processing speed to name some ones. Thus, the limitations of the 
nodes are an important issue has to be considered in the implementation of FIS. 

4.2.1 Member Functions 

Considering that the microcontroller used in the node does not have a math 
coprocessor or float-point unit, the type MF to be selected should be the simplest 
ones which do not required a high processing algorithm and integer arithmetic is 
enough. This limits the selections to triangular and trapezoidal type [27]; however 
keeping in mind the fact b) described in last section: "each category of the 
modulated preamble is presented by the transmitter node during enough time to 
accomplish three period lengths measures by the receiver nodes". The most 
appropriate MF in this case is a symmetrical trapezoid type which is shown in 
Fig. 8 and defined by the equation 1. 
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Fig. 8. Two adjacent MFs type trapezoidal 
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trapezoid(x; a, b,c, d) = < 



, 

x—a 
b-a' 

1 , 
d-x 
d-c' 

, 



x < a 
a < x < b 

b < x < c 
c < x < d 
d < x 



(1) 



The microcontroller does have hardware 2-cylce multiplier instruction (MUL) and 
fractional multiplier instruction (FMUL) [10]; so to implanting or use standard 
math library introduce a computational overhead. In order to avoid that overhead 
and to achieve a compact and fast implementation, the trapezoid MF could be used 
scaling the functions to divisor (b-a) [or (d-c) because is a symmetrical] and so 
eliminating the division operation allowing to use a fast integer arithmetic 
operations; but keeping in mind the value is scaled. 

Thus, using this approach the symmetrical trapezoidal MFs are scaled to 8 
because is easy to remove it just using the shift-to-right (») instruction three 
times, it since this operation is equivalent to divide by two. In Fig. 9 are shown the 
first three MF that compose the 10 MF of the fuzzifier section. 




48 56 



72 80 



Fig. 9. Scaled trapezoidal member functions 



The implementation to calculate the membership value for this MF is sown in 
the List 1 as a C language trapMF function. This function has a small size 
memory footprint (32 native instructions only) and is implemented to calculate the 
membership grade given a value (x) and the number (0...9) of the 10 MFs to be 
used (numFM). In best case scenario (returning by the first if instruction) the 
microcontroller spent lluS to calculate and return the value and 18uS for worst 
case scenario (returning by the last else instruction). 
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List 1. Trapezoidal member function implemented in C language. 
UINT8 trapMF ( UINT8 x , UINT8 numMF ) { 

UINT8 a=0,b=8,c=2 4,d=32,delta=2 4; 



delta *= numMF; 
a += delta; 
b += delta; 
c += delta; 
d += delta; 



/* delta according to # of MF */ 
/* adjust parameter a,b,c and d */ 
/* according to # of MF */ 



if ( x <= a ) return 0; 
if ( x > a && x <= b ) return x-a; 
if(x>b&&x<=c ) return 8; 
if ( x > c && x <= d ) return d-x; 
else return ; 



4.2.2 FIS Processing 

The Fuzzy Inference System used in this work uses the process algorithm showed 
in the List 2. This starts taking three period length measures values (xl, x2, and 
x3) which are verified to check for error reading. The three values should be 
belonging to two consecutive MFs; if this is not complied with, the process starts 
again. In this verification the two adjacent MFs are identified (A and B) and now 
all data are ready to fuzzifier inputs and apply the If-then rule process. Thus 
the firing strength (w) of each rule is calculated using the minimum as AND 
operator. Then the linear function output of each rule is calculated using the values 
(xl, x2 and x3) and their corresponding parameters (p, q and r). Finally, the 
overall output (Y) is calculated using weighted sum; this is accomplished 
accumulating all results of multiply the linear function output of each rule by its 
corresponding firing strength. 
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List 2. FIS process algorithm. 

1. Take 3 consecutive measures values (xl , x2 , x3 ) 

2. If error goto step 1 /*values aren't in 2 adjacent 
MF*/ 

3 . Determine the two involved MF (A and B) 

4. Apply rules to determinate firing strength: 



w[0] = min 

w[l] = min 

w[2] = min 

w[3] = min 

w[4] = min 

w[5] = min 

w[6] = min 

w[7] = min 



mf_A(xl) 
mf_A(xl) 
mf_A(xl) 
mf_A(xl) 
mf_B(xl) 
mf_B(xl) 
mf_B(xl) 
mf_B(xl) 



mf_A(x2) 
mf_A(x2) 
mf_B(x2) 
mf_B(x2) 
mf_A(x2) 
mf_A(x2) 
mf_B(x2) 
mf_B(x2) 



mf_A(x3) 
mf_B(x3) 
mf_A(x3) 
mf_B(x3) 
mf_A(x3) 
mf_B(x3) 
mf_A(x3) 
mf_B(x3) 



Calculate linear functions outputs: 
y[n] = p[n]*xl + q[n]*x2 + r[n]*x3; n= [ . . 7 ] 
Calculate overall Y output using weighted sum: 
Y = £ w[n] *y[n] ; n=[0. .7] 



In order to achieve a small size memory program implementation and provide an 
acceptable computational performance; the algorithm (List 1) described before has 
been slightly modified and so to reduce its execution time and memory used. The 
modifications are described as follows: 

a) All membership values are calculated and stored in a small memory array 
(one per input) to be used at the moment if-then rules are applied and so 
avoid unnecessary recalculations. 

b) There is only one memory location for the firing strength (w) and other 
for linear function outputs (y) that are reused in each rule calculation. 

c) The weighted sum (Y) is calculated iteratively just after the firing 
strength (w) and linear function outputs (y) are calculated for each rule. 

As result of these modifications the final code of the principal engine for the FIS 
is showed in List 3. This code uses just 128 native instructions that correspond to 
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256 bytes of program memory giving us a very small memory footprint as result. 
In addition, this code presents an acceptable performance having an execution 
time of around 700uS to complete the entire process having the input values (xl, 
x2, and x3) to calculate the overall output (Y). It is important to remark that the w 
and the parameter p, q, s are scaled, so the linear function output (y) and the 
overall output (Y) are divided by 8 using the instruction shift-to-right three times. 

List 3. FIS process implemented in C language. 

n=0; /* used as index */ 

Y = 0; /* overall FIS output */ 

B = A + 1 ; /* member functions to be used in FIS */ 

/* calculate the membership values of xl,x2 and x3 */ 
/* for each member function - A and B */ 

mfxl [0] =trapMF(xl,A) ; mfxl [1] =trapMF (xl, B) ; 
mfx2 [0] =trapMF(x2,A) ; mfx2 [1] =trapMF (x2 , B) ; 
mfx3 [0] =trapMF(x3,A) ; mfx3 [1] =trapMF (x3 , B) ; 

/* apply rules and calculate for each rule: */ 
/* a) firing strength */ 

/* b) linear output function */ 

/* c) and accumulate it to the overall output */ 
for( i=A; i<B; i++ ) 
{ 

for( j=A; j<B; j++ ) 

{ 

for ( k=A; k<B; k++ ) 

{ 

/* AND operator to get firing strength */ 
w = min ( mfxl[i], mfx2[j], mfx3 [k] ) ; 

/* linear output function using p q and r */ 
y = ( p[n]*xl + q[n]*x2 + r[n]*x3 ) » 3 ; 
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/* next index for parameters p q and r */ 
n++; 

/* overall output ==> weighted sum */ 
Y = Y + ( ( w*y ) » 3 ) ; 



} 

} /* output Y is ready */ 

5 Experimental Test and Results 

In order to verify and validate the FIS performance some experimental test have 
been realized, however the most representative test is presented in this work. This 
test consists of one node-SR transmitting four broadcast type packets per second 
to 10 nodes. The 10 nodes are working with the MP-MAC protocol using the FIS 
and the average method. The simple average method is applied to the three inputs 
measures (xl, x2 and x3) to calculate the time-to-sleep. The plan to use both 
methods in each node is to make a comparison between them under same 
conditions, so determine the benefit of using the FIS versus a traditional approach. 
In addition, the uses of 10 nodes in the test give us a mechanism to detect 
abnormal behaviors on nodes; it because the same behavior is expected from all of 
them. A simple schema of the test configuration is shown in Fig. 10 and a picture 
of the test bench used during the test period as well. 



Node 1 Node 2 Node 3 



Node 9 Node 10 




Fig. 10. Test configuration for nodes and test-bench 
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The test starts with the transmitter node that sends an initial broadcast packet 
to synchronize and notify to receiver nodes that the test has started. Then, the 
transmitter node sends 4 packets per second which are sequentially labeled during 
5 minutes having a total of 1200 packets. The nodes do not use the calculated 
time-to-sleep value to sleep and save energy, instead they stay wake-up to receive 
the packets to avoid missing packets because both methods are in evaluations and 
they could carry out wrong calculations. The receiver nodes store the following 
important set of data for each received packet to be retrieved later for further 
analysis. 

1) The three period length measures used in the methods (FIS or simple 
average). 

2) The calculated time-to-sleep determinate by each method. 

3) Number of tries used to get acceptable value (average approach is one) 

4) The label of received packet in order to determinate missing packets. 

As test result, all nodes presented highly similar stored information without any 
abnormal behavior on them. Thus, this gives us a valid and acceptable data to be 
analyzed. Fig. 11 shows part of data stored by a node and is presented as a 
comparative chart between the FIS and average methods. 
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Fig. 11. Data stored by a receiver node 

The following important interpretations are determinate based on the data 

analysis. 



1) In a quiet scenario where there is not RF noise in the environment, the 
three period length measures are close and stable; in other words the 
measures belong to the same member function both method are compatible 
and produce same results. However, is important to remark that the simple 
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average are faster than FIS method due to last one requires more complex 
calculations. 

2) If the three period length measures have discrepancies or they are 
dispersed, the simple average approach produces wrong output values. In 
other hand, the FIS could detect the problems and then take new measures 
to start again. This approach is able to detect that kind of problems because 
it has a certain human knowledge embedded into the rules section of the 
FIS. 

3) The nodes using the FIS present better performance than the average 
approach having around 90% average in saving energy; this means that the 
nodes sleep during 90% of the long preamble reducing significantly the 
overhearing problem. This performance is related to the best case scenario 
where the nodes are synchronized to the duty cycle of the transmitter node. 
In a non- synchronized test scenario the FIS lower than 90% but still better 
than simple average approach, it is basically because the FIS has the 
capability to detect possible errors and starts again (interpretation 2). 

6 Conclusions an Future Work 

In this work, a fuzzy inference system has been proposed, implemented and tested 
in nodes of a WSN which use a novel modulated preamble MAC protocol to 
reduce the overhearing problem. The parameter used in the FIS were determined 
by using human experience and based in facts detected in the system. 
Experimental test were realized to verify and validate the FIS performance and so 
to determine the computational cost and benefit of use the FIS compared to a 
traditional approach (simple average). The test result presents a good and 
acceptable performance that in all cases results better than traditional approach. 
However, the FIS has an implementation that uses more memory footprint and 
computational execution time; but they are not presenting a big impact due to the 
memory used is just 125 native instructions (250 bytes of program memory) with 
an execution average time of 700 micro seconds to complete FIS calculation 
process. As propose for future work, the FIS should be optimized adjusting its 
parameter applying the back-propagation algorithm off-line and using the stored 
data obtained in the experiments realized. 

References 

1. Akyildiz, I.F., Su, W., Sankarasubramaniam, Y., Cayirci, E.: Wireless Sensor 
Networks: a survey. Computer Networks Journal 38(4), 393-422 (2002) 

2. Sohoraby, K., Minoli, D., Znati, T.: Wireless sensor networks. Technology, protocols 
and applications. John Wiley & Sons, New Jersey (2007) 

3. Hac, A.: Wireless Sensor Network Designs. John Wiley & Sons, Honolulu (2003) 

4. Callaway Jr., E., Callaway, E.: Wireless Sensor Networks: Architectures & Protocols. 
Auerbach Publications, Florida (2003) 



142 L. Aguilar et al. 

5. Crossbow Technologies, Inc., http : //www.xbow. com 

6. Sentilla Corporation, http: //www. sentilla. com 

7. Aguilar, L.: LiSANDRA an Experimental Wireless Network, PhD thesis, Universidad 
Autonoma de Baja California (2009) 

8. Ilyas, M., Mahgoub, I.: Handbook of Sensor Networks: Compact Wireless and Wired 
Sensing Systems. CRC Press, Florida (2004) 

9. Aguilar, L., Licea, G., Garcia-Macias, J.A.: An Experimental Wireless Network 
applied to engineering courses. Computer Applications in Engineering Education. 
Wiley-InterScience, Hoboken (2009) 

10. Atmel Corporations, http : / /www. atmel . com 

11. Sensirion, http : //www. sensirion. com 

12. Texas Advanced Optoelectronic Solutions, Inc., http : / /www. taosinc . com 

13. Linx Technologies, Inc., http: //www. linxtechnologies . com 

14. WinAVR, http://winavr.sourceforge.net 

15. Danahoo, M., Calvert, K.: TCP/IP Sockets in C: Practical Guide for Programmers. 
Morgan Kaufmann Publishers, San Francisco (2001) 

16. El-Hoiydi, A.: Aloha with preamble sampling for sporadic traffic in ad hoc wireless 
sensor networks. In: Proc. IEEE Int. Conf. on Communications, ICC 2002 
(April 2002) 

17. El-Hoiydi, A., Decotignie, J.: WiseMAC: An ultra-low power MAC protocol for 
multi-hop wireless sensor networks. In: Proc. 1st Int. Workshop on Algorithmic 
Aspects of Wireless Sensor Networks (2004) 

18. Polastre, J., Hill, J., Culler, D.: Versatile low power media access for wireless sensor 
networks. In: Proc. SenSys 2004, pp. 95-107 (2004) 

19. IEEE Computer Society LAN MAN Standards Committee: Wireless medium access 
control (MAC) and physical layer (PHY) specifications for low rate wireless personal 
area networks (LR-WPANs). IEEE Std. 802.15 (2004) 

20. Chipcon, Products from Texas Instruments. CC2420 data sheet, 2.4 Ghz IEEE 
802. 15.4 / Zigbee-ready RF transceiver 

21. Buettner, M., Yee, G.V., Anderson, E., Han, R.: X-MAC: a short preamble mac 
protocol for duty-cycled wireless sensor networks. In: Proc. 2nd ACM Conf. on 
Embedded Networked Sensor Systems, SenSys 2006, pp. 307-320 (2006) 

22. Wong, K.-J., Arvind, D.: Speckmac: Low-power decentralized MAC protocol low data 
rate transmissions in specknets. In: Proc. 2nd IEEE Int. Workshop on Multi-hop Ad 
Hoc Networks: from Theory to Reality, REALMAN 2006 (May 2006) 

23. Merlin, C.J., Heinzelman, W.B.: Network-aware adaptation of MAC scheduling for 
wireless sensor networks. In: Aspnes, I., Scheideler, C, Arora, A., Madden, S. (eds.) 
DCOSS 2007 Poster Session. LNCS, vol. 4549. Springer, Heidelberg (2007) 

24. Zadeh, L.A.: Fuzzy sets. Information and Control 8, 338-353 (1965) 

25. Mamdani, E.H., Assilian, S.: An experiment in linguistic synthesis with a fuzzy logic 
controller. International Journal of Man-Machine Studies 7(1), 1-13 (1975) 

26. Sugeno, M.: Industrial applications of fuzzy control. Elsevier Science Pub. Co., 
Amsterdam (1985) 

27. Jang, J.S.R., Sun, C.T.: Neuro-Fuzzy and Soft Computing: A Computational Approach 
to Learning and Machine Intelligence. Prentice Hall, Englewood Cliffs (1997) 



Multi-Agent System Based on Psychological Models for 
Mobile Robots 

Arnulfo Alanis Garza, Oscar Castillo, and Jose Mario Garcia Valdez 

Division of Graduate Studies and Research, Instituto Tecnologico de Tijuana, Calzada 
Tecnologico, Tijuana, Mexico 

alanis@tectijuana. edu.mx, ocastillo@tecti juana.mx 



Abstract. We describe the design of Intelligent Agents and experimenting with 
learning environments in mobile robotics, using new technologies such as the 
Knowledge Query and Manipulation Language (KQML), and new features of the 
Jade programming language, for the creation of distributed applications such as 
Remote Method Invocation (RMI). A society of intelligent agents could solve 
these problems. These agents would be able to migrate from one node to another, 
the ability to communicate with other agents, using an expressive communication 
language, work together cooperatively to accomplish complex objectives for a 
user, act on their own initiative and use local information and knowledge to 
manage resources and requirements of other agents. Most existing software 
ignores the fact that generally the tasks performed by an individual are part of 
collective activities. There is individual work, is working with third-group 
contributions and consensus with others, is why the support systems to 
collaborative work are presented as a particular need for the development of future 
software and intelligent agents, as an essential component design and 
implementation. 



1 Introduction 

Making smarter robots is a problem that has captivated the scientific community 
for several years now, this being a big challenge to overcome even for man. The 
move from one place to another from an initial to a final point with only the 
information of what we want to reach at the end is a complicated task. The 
problem has been solved with soft computing methods, such as fuzzy logic, neural 
networks, genetic algorithms, hybrid systems, among others. Navigation of the 
robot has been achieved taking a completely controlled environment where we 
have total knowledge of the world [4,7, 11]. 

Human beings have the ability to react to unexpected situations and the ability 
to use their reasoning to react to situations that advise, an example is a situation of 
traffic management, to which we think and what is the best route and with this the 
rest of the trip is done reacting to what was observed. This could be, at a traffic 

O. Castillo et al. (Eds.): Soft Computing for Intell. Control and Mob. Robot., SCI 318, pp. 143 J159.I 
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light, when it changes from green to yellow, the reaction will be learning, 
acceleration or deceleration, a situation that when you start, it has to be learned. 

Knowing that we have this ability to react, we will use fuzzy logic to transfer 
the knowledge we use to carry out this process by establishing computer-type "if- 
then" fuzzy rules and exploiting the advantages offered to us by fuzzy logic, 
which is the use of linguistic variables. 

A man driving a car as in many other activities derives its information by 
looking at distances, clearances and other identifying information that we collect 
through the light. In this research we use computer vision techniques to draw from 
that vital information for navigation control, supported with special sensors. An 
example is the ultrasonic sensors that are used to measure distance and the light 
sensors that we can use to measure the change in light intensity, all of these help 
to extract environmental information necessary for decision making. 

2 Preliminary 

2.1 Agents 

Let's first deal with the notion of intelligent agents. These are generally defined as 
"software entities", which assist their users and act on their behalf. Agents make 
your life easier, save you time, and simplify the growing complexity of the world, 
acting like a personal secretary, assistant, or personal advisor, who learns what 
you like and can anticipate what you want or need. The principle of such 
intelligence is practically the same of human intelligence. Through a relation of 
collaboration-interaction with its user, the agent is able to learn from himself, from 
the external world and even from other agents, and consequently act 
autonomously from the user, adapt itself to the multiplicity of experiences and 
change its behavior according to them. The possibilities offered for humans, in a 
world whose complexity is growing exponentially, are enormous [12] [13]. 

We need to be careful to distinguish between rationality and omniscience. An 
omniscient agent knows the actual outcome of its actions, and can act accordingly; 
but omniscience is impossible in reality. Crossing the street was rational because 
most of the time the crossing would be successful, and there was no way I could 
have foreseen the falling door. Note that another agent that was equipped with 
radar for detecting falling doors or a steel cage strong enough to repel them would 
be more successful, but it would not be any more rational [17]. 

2.2 Agent Based Simulations 

As these non-linear, adaptive interactions are mostly too complex to be captured 
by analytical expressions, computer simulations are most often used. The basic 
idea of such simulations is to specify the rules of behavior of individual entities, as 
well as the rules of their interaction, to simulate a multitude of the individual 
entities using a computer model, and to explore the consequences of the specified 
individual-level rules on the level of population as a whole, using results of 
simulation runs. As the simulated entities are usually called agents, the 
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simulations of their behavior and interactions are known as agent-based 
simulations. The properties of individual agents describing their behavior and 
interactions are known as elementary properties, and the properties emerging on 
the higher, collective level are known as emergent properties [8]. 

2.3 Basic Properties of Agent-Based Models 

What makes agent-based models particularly appealing and interesting is that 
consequences on the collective level are often neither obvious, nor expectable, 
even in many cases when the assumptions on individual agent properties are very 
simple. Namely, the capability of generating complex and intriguing emergent 
properties arises. 

Not so much from the in-built rules of individual agent behavior, as from the 
complexity of the network of interactions among the agents. Precisely this 
multitude of agents, as well as the multitude and complexity of their interactions, 
are the main reasons why in most cases formal mathematical deduction of results 
of an agent-based model is not possible. 

This is also the reason why the issues of complexity remained relatively under- 
explored until recently. Namely, as scientists regularly decide to pay attention to 
"problems defined by the conceptual and instrumental techniques already at hand" 
[4], "some facts [...] are pushed to the periphery of scientific investigation, either 
because they are thought not to be relevant, or because their study would demand 
unavailable techniques". Accordingly, only after recent advances in the development 
of computational technology have enabled massive simulation experiments, the 
issues of emergent complexity came closer to the focus of scientific research. 

Besides the above mentioned modeling of bottom-up effects, i. e. the effects 
originating at the individual level and influencing the collective one, more 
complex agent-based models are also capable of modeling top-down effects, 
arising at the collective level and influencing the level of individual agents [14]. 

3 Agent Based Models in Social Sciences 

Although most well-established within the framework of natural sciences, the 
application of agent-based simulations within the field of social sciences since 199 
is also significantly growing. It must be emphasized that the primary purpose of 
agent-based modeling and simulations in social sciences is not prediction. 
Namely, social processes are usually so complex that their maximally faithful 
replicas would hardly ever be possible. The consequence is that agent-based 
models mostly do not possess the level of "accuracy "needed for a model to be 
used for predictive purpose[14,8,15]. 

4 What Is Fuzzy Logic? 

In this context, FL is a problem-solving control system methodology that 
lends itself to implementation in systems ranging from simple, small, embedded 
micro-controllers to large, networked, multi-channel PC or workstation-based data 
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acquisition and control systems. It can be implemented in hardware, software, or a 
combination of both. FL provides a simple way to arrive at a definite conclusion 
based upon vague, ambiguous, imprecise, noisy, or missing input information. 
FL's approach to control problems mimics how a person would make decisions, 
only much faster [6]. 

FL incorporates a simple, rule-based IF X AND Y THEN Z approach to a 
solving control problem rather than attempting to model a system mathematically. 
The FL model is empirically-based, relying on an operator's experience rather than 
their technical understanding of the system. For example, rather than dealing with 
temperature control in terms such as "SP =500F", "T <1000F", or "210C <TEMP 
<220C", terms like "IF (process is too cool) AND (process is getting colder) 
THEN (add heat to the process)" or "IF (process is too hot) AND (process is 
heating rapidly) THEN (cool the process quickly)" are used. These terms are 
imprecise and yet very descriptive of what must actually happen. Consider what 
you do in the shower if the temperature is too cold: you will make the water 
comfortable very quickly with little trouble. FL is capable of mimicking this type 
of behavior but at very high rate [18]. 

5 Big Five Personality Factors 

Why do we study personality? 

The NEO that you have just completed looks at 5 personality traits, known as the 
Big Five. We will briefly look at what traits are, how these personality factors 
were determined, what the traits mean, what the Big Five predict about our 
behavior, and how these factors might relate to motivation. 

What are traits? 

Traits are consistent patterns of thoughts, feelings, or actions that distinguish 
people from one another. Traits are basis tendencies that remain stable across the 
life span, but characteristic behavior can change considerably through adaptive 
processes. A trait is an internal characteristic that corresponds to an extreme 
position on a behavioral dimension. 

There have been different theoretical perspectives in the field of personality 
psychology over the years including human motivation, the whole person, and 
individual differences. The Big Five falls under the perspective of individual 
differences. 

How were these personality factors determined? 

The Big Five represents taxonomy (classification system) of traits that some 
personality psychologists suggest capture the essence of individual differences in 
personality. These traits were arrived at through factor analysis studies. Factor is a 
technique generally done with the use of computers to determine meaningful 
relationships and patterns in behavioral data. You begin with a large number of 
behavioral variables. The computer finds relationships or natural connections 
where variables are maximally correlated with one another and minimally 
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correlated with other variables and then groups the data accordingly. After this 
process has been done many times a pattern appears of relationships or certain 
factors that capture the essence of all of the data. Such a process was used to 
determine the Big Five Personality factors. Many researchers tested factors other 
than the Big Five and found the Big Five to be the only consistently reliable 
factors [6, 14, 16]. 

Strict trait personality psychologists go so far as to say our behavior is really 
determined by these internal traits, giving the situation a small role in determining 
behavior. In other words, these traits lead to an individual acting a certain way in a 
given situation. 

Allport, Norman and Cattell were influential in formulating this taxonomy 
which was later refined. Al port compiled a list of 4500 traits. Cattell reduced this 
list to 35 traits. Others continued to analyze these factors and found congruence 
with self- ratings, ratings by peers and ratings by psychological staff that 
eventually became the Big Five factors. 

The Big Five factors are: 

I - extraversion vs introversion 

II - agreeableness vs antagonism 

III - conscientiousness vs undirectedness 

IV - neuroticism vs emotional stability 

V - openness to experience vs not open to experience 

There was a need for an integrative framework for measuring these factors. The 
NEO Personality Inventory was created by Costa and McCrae and originally 
measured only neuroticism, extraversion and openness. The other factors were 
added later. There are other measures of the Big Five, such as the BFI (Big Five 
Inventory) and the TDA (Traits Descriptive Adjectives). The NEO has the highest 
validity of the Big Five measurement devices[9]. 

What do the five traits mean? 

Keep in mind that the traits fall on a continuum and this overhead shows 
characteristics associated with each of the traits. Looking at these characteristics 
we can formulate what each of the traits mean. 

• Openness - (inventive / curious vs. cautious / conservative). Appreciation 
for art, emotion, adventure, unusual ideas, curiosity, and variety of 
experience. 

• Conscientiousness - (efficient / organized vs. easy-going /careless). A 
tendency to show self-discipline, act dutifully, and aim for achievement; 
planned rather than spontaneous behavior. 

• Extroversion - (outgoing / energetic vs. shy / withdrawn). Energy, 
positive emotions, urgency, and the tendency to seek stimulation in the 
company of others. 

• Agreeableness - (friendly / compassionate vs. competitive / outspoken). 
A tendency to be compassionate and cooperative rather than suspicious 
and antagonistic towards others. 
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• Neuroticism - (sensitive / nervous vs. secure /confident). A tendency to 
experience unpleasant emotions easily, such as anger, anxiety, 
depression, or vulnerability [6, 14, 16]. 

6 Proposed Method 

We present a sample application of ODD to a population based model of a football 
game, for reasons of limited space we have chosen a relatively simple model that 
describes many of the processes empirically by using probabilities, for example, 
"defend the doorman". 

Extraversion, Agreeableness 

The purpose of the model is to understand how social Partre players' behavior, 
particularly territoriality. 

Conscientiousness 

The model comprises four hierarchical levels: individual territory of (Target) of 
the equipment and the environment. Individuals are characterized by state 
variables: identity number, age, sex, identity of the area where the individual and 
social life. The players have recently entered the process of additional state 
variable weight, which affects their behavior. The Complaint players who have 
not known his first game as minors, 1-game played, as initiates, and all others as 
completed. Apart from this, social rank is the main attribute which tells the 
difference between the complete key (experts) and subdominant (started). 

Openness, Neuroticism 

The product of the model in steps, the time of each match will be fingers or stages 
of processing modules in the following order: the individual, the expert, height, 
weight, changing players, changing positions. Within each module, the players 
and the field are processed in random order. A player in the life cycle is 
represented as a tree. 

6.1 Design Concepts 

Emergency: Dynamics of the population to leave the conduct of (the players), but 
the individual's life cycle and behavior are fully represented by empirical rules. 

For this to work, each player creates an SMA (robot), which consists of three 
agents, an agent for the management of the sensors, an agent that is responsible for 
controlling the engine and the last agent to act as coordinating multi-agent system 
[1,2,3]. 

Summarizing the SMA is composed of the following agents: 

Node Agent (AN), a set of sensors and personality factors. 
Task Agent (AT), a set of servo motors. 
System Agent (AS), coordinator of the system. 
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The Node Agent (NA) will be responsible for the sensors used by our robot that are: 

• Ultrasonic Sensor 

• Light Sensor left 

• Light Sensor right 

• Camera 

The set of sensors has great features associated to the personality, such as 
openness, conscientiousness, extroversion Agreeableness, Neuroticism. 

Node Agent through the sensors, receive their world and according to its 
environment and activity, has one of the five personality factors, which may 
change according to the perceptions of the world and the other elements. 

These sensors are used to interact with the world to achieve our goal. 

The Task Agent (TA) will be used to drive to move forward, rewind, or make 
some movement to escape. 

The System Agent (SA) will be responsible for coordinating the other actors, the 
AN and AT, and will also have activities not only the coordination, which will be to 
recognize the object that is opposite, calculate the angle of the object, which will help 
us to know that will guide the robot. Depending on the recognition that was obtained, 
whether or not the object, which will tell us that there is no escape or the object based 
on this decision was to trigger the switch which will control the outcome to be used, 
reactive or path to better performance of the controllers, fig 1 [5]. 
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Fig. 1. Architecture of the complete system 
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We can describe the proposed method as follows: 

In the operation of the model, we have 2 main blocks, which are responsible for 
knowledge and learning (paradigm of intelligent agents) and the vision and control 
(fuzzy logic). 

These modulus are described below, the first module contains 3 agents, NA 
[Node Agent], TA [Task Agent] SA [System Agent] Agent System (AS), and will 
know every time the operation of the other agents. 

To detect a change in the environment the Agent Node [that is in charge of the 
sensors (ultrasonic, camera and two light sensors)] with the ultrasonic sensor and 
two light sensors, starts its operation, and we start at the state called reactive 
control; this because you have to move and sense all the time until it finds an 
obstacle, this can happen once a photo is taken, which will be sent to the database 
of images (BDI), the image will be applied a pre-processing for recognition in 
order to know whether the object is found, this decision was taken on the angle 
(angle to take the decision to bypass the object), able to escape and move forward, 
trend data, they are caught in the trajectory control process, to observe the speed 
values. The Task Agent (which is responsible for the drive), once all the necessary 
parameters for the performance of the robot agent system (AS) who is the 
coordinator, and executes instructions in the robot. 

For the development of the Multi- Agent System we used Gaia [19] and UML 
for the analysis and design of the agents. The completion of the Multi-Agent 
System (MAS) is based on the FIPA [20] standards for better utilization and 
greater possibility of extension [18]. 

6.2 Gaia 

Below we show the diagrams in which we can observe the responsibilities, 
permissions, activities and protocols to be followed by each agent in the Multi- 
Agent System (MAS). 

Agent node 

Responsibilities: 

Sense: 

• Active 

• Inactive 

• And each of the Big Five Personality factors, and depending on which 
factor is the one needing the robot will have one or more factors, and this 
will give her unique personality (openness, conscientiousness, 
extroversion Agreeableness, Neuroticism). 

Permissions: 

Ultrasonic sensor: measures distances from cm to 255 cm, with a delay 
of one millisecond signal. 

Camera Sensor: take pictures in an estimated time of seconds, saves all 
images in a folder for processing. It has a degree of vision approximately of 
50 degrees. 
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Light sensors: they measure the intensity with which an object reflects light. 
This makes a light emitting and measuring the portion of the return is received. 

Activities: 

The only activities are sensing and taking pictures of the world. 

Protocols: 

Ultrasound: This is responsible for sensing the distance to get to an 
obstacle. The initiative for this would be the agent system, which indicates 
the time of initiation. 

Chamber is responsible for obtaining the images within the scene, and as 
for the ultrasonic sensor system depends for its initiation. 

Light sensors: the role of these sensors is to measure the intensity with 
which an object reflects light (walls, diagrams). This makes a light emitting 
and measuring the portion of the return is received and handled the ranges 
are to 1023 RAW. 

Agent system 

Responsibilities: 

It is responsible for image processing and decision making, and sends a 
message to agent communication process completed? 

Permissions: 

This is the one that has full access, sensors, communication, engine and 
handling agents and NXT in general. 

Activities: 

Making decisions based on the behavior and implements what is needed to 
finish the job. 

Protocols: 

Full Access, sensors, communication, engine and handling agents and 
NXT in general, the entry for this information would be provided by the 
agents, getting a response from these so they generate an output and generate 
interaction among them. 

Task agent 

Responsibilities: 

Motor movement 
Permission: 

Depends entirely on the agent system to perform its task. 

Activities: 

Sensors for the engine: they measure in degrees is given for each engine is 
360 degrees for one revolution, a revolution for the tire of each motor, the 
motors can rotate independently, can also be used with constant speed motors. 
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6.3 Knowledge Base 

The knowledge base with which the MAS is working, is shown below. 

NA 

NAj.SU. STATE =(X), where X can have 7 states (ON, OFF, OPENNESS, 

CONSCIENTIOUSNESS, EXTROVERSION AGREEABLENESS, 

NEUROTICISM) 

NAi.SU.OBJ=(Y) where Y can have 2 states (DETECTS, NO DETECTS) 
NAi.SC. STATE = (Z) where Z can have 7 states (ON, OFF, OPENNESS, 

CONSCIENTIOUSNESS, EXTROVERSION AGREEABLENESS, 

NEUROTICISM) 
NAi.SC.TIMAGE 

NAi.SLI. STATE = (W) where W can have 2 states (ON, OFF) 
NAi.SLD. STATE = (V) where V can have 2 states (ON, OFF) 

7 Results 

We show in Figure 2 the plant and the controller that we are used for achieving the 
simulation in Simulink of Matlab. 
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Fig. 2. Plant for simulation 



Together with some fuzzy systems which were developed based on work taken 
as a reference [9,10] and that were the basis for the experimentation with different 
numbers of rules, parameters of membership functions, and types of membership 
functions was a good performance of our robot, achieving our goal. 
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The first input variable of the fuzzy system is the ultrasonic sensor which has three 
membership functions which are linguistic (close, near, far), as shown in Fig. 3. 




Fig. 3. The Ultrasonic sensor 

The second variable of the fuzzy system is the light sensor that has three 
membership functions, which are low, middle and high, as shown in Fig. 4. 




Fig. 4. The left light sensor 

The third variable of the fuzzy system is the light sensor that has three 
membership functions, which are low, middle and high, as shown in Fig. 5. 
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Fig. 5. The right light sensor 
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This is the first output variable is the speed of the left engine, has five 
membership functions as shown in Fig. 6. 
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Fig. 6. Motor left 
In Fig. 7 we show the rules that are used in our system, which are 10 fuzzy rules. 
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Fig. 7. Rules of the fuzzy system (FIS) 



An example of the results simulating this fuzzy system is shown in the figures 
8, 9 and 10, in which we see that as our robot moves forward in the path of the 
maze to find out achieved or failing not find it. 

Having modified and improved the fuzzy systems, we proceeded with a slight 
modification to our Multi-Agent System, which consists in adding a new agent 
which has the task of giving a second option in case our robot gets stuck 
somewhere in the world, for example corners in which our sensors cannot do some 
action, then come into action. The new agent called Support Agent enters into 
action as I mentioned in the case that the sensor values do not become active some 
rule of fuzzy systems that we throw the speed of the drive to get out of where we 
are stuck with only works with the task agent (TA) for which our multi-agent 
system would be as shown in Fig. 1 1 . 
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Fig. 8. Simulation- 11 duration of 01:03 minutes 
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Fig. 9. Simulation-22 duration of 01:30 minutes 
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Fig. 10. Simulation-3 duration of 00:58 minutes 
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Fig. 11. New scheme of Multi- Agent System 
Integrating it to our general scheme we would be as shown in Fig 12. 
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Fig. 12. New architecture of the complete system 

Now with these changes we had better control of movements of the robot in the 
search for the exit of the maze, which can be seen in the graphs of the following 
experiments, fig. 13,14,15. 
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Fig. 13. Simulation-1 duration of 00:55 minutes 




Fig. 14. Simulation-2 duration of 00:47 minutes 




Fig. 15. Simulation-3 duration of 00:47 minutes 



158 A.A. Garza, O. Castillo, and J.M. Garcia Valdez 

8 Conclusions 

The development of intelligent applications is varied, in this particular case, the 
implementation of multi-agent systems is complex because it consists of several 
agents, whit the incorporation of other techniques, making it even more complex. 

One of the main objectives of this research initially was to open a new area of 
intelligent agents using fuzzy logic, which hitherto has not been given the 
approach that is intended here, which gives a degree of utilization of this paradigm 
of intelligent agents, but more research is needed in order to defend this idea as 
innovative. 

Considering the non-optimized fuzzy systems, we have shown better results 
than were obtained in the work we take as a reference [14] for what we have 
achieved a good percentage of the overall project objectives and to improve not 
only the times but we have improved how it moves the robot by making our world 
more soft and therefore earns a bit of time helping to improve. 

And we gave another level to our Multi- Agent System as it not only succeeded in 
getting out of the labyrinth with which we have been working but we show results 
where the maze was changed in different ways and still finds the exit and not in a 
very long time. Indeed the average that has been obtained in the work of reference 
and is even a bit less here, so we are happy with the results we have achieved over not 
satisfied because we still have work to resolve because we have not yet optimized the 
system, with the hope that optimization can further improve the results obtained 
previously and perhaps even lower computation times are achieved. 
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Abstract. We describe in this paper an approach for mathematical function opti- 
mization using parallel computing combining Particle Swarm Optimization (PSO) 
and Genetic Algorithms (GAs) and Fuzzy Logic for parameter adaptation and in- 
tegrate the results. The parallel evolutionary method combines the advantages of 
parallel computing, PSO and GA to give us an improved parallel FPSO+FGA hy- 
brid method. Fuzzy Logic is used to combine the results of the PSO and GA in the 
best way possible. The parallel hybrid FPSO+FGA method was developed using a 
computer with processor Intel Core 2 Quad of 64 bits that works to a frequency of 
clock of 2.5 GHz, 6 GB of RAM Memory and Ubuntu Linux Operating System. 

Keywords: Parallel FPSO, FGA, GA. 
1 Introduction 

We describe in this paper a new parallel evolutionary method combining PSO and 
GA, to give us an improved FPSO+FGA hybrid method. We apply the parallel 
hybrid method to mathematical function optimization to validate the new approach 
using the parallel mode of the Matlab programming language. The application of a 
Genetic Algorithm (GA) and Particle Swarm Optimization (PSO) separately using 
parallel computing for the optimization of mathematical functions is shown in [5]. 
In this case, we are using a set of mathematical benchmark functions [4] [13] to 
compare the optimization results between a GA, PSO and FPSO+FGA. 

The main motivation of this method is to combine the characteristics of a GA 
and PSO in parallel mode using fuzzy logic. We are using several fuzzy systems 
to perform dynamical parameter adaptation. For decision making between the 
methods depending on the results that we are generating we are using one fuzzy 
system. The fuzzy system is used to decide and combine the outputs of both the 
GA and PSO, in this way obtaining the best solution to the optimization problem. 
Also, there are another two fuzzy systems to parameter adaptation in GA and 
PSO. The main goal of the fuzzy system is to evaluate the outputs of the GA and 

O. Castillo et al. (Eds.): Soft Computing for Intell. Control and Mob. Robot., SCI 318, pp. 16l l-178. 1 
springerlink.com © Springer- Verlag Berlin Heidelberg 2010 
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PSO in each generation and change if is necessary some important parameters. 
The criterion for stopping the method is the maximum number of generations. 

The paper is organized as follows: in section 2 a description about Genetic 
Algorithms for optimization problems is presented, in section 3 the Particle 
Swarm Optimization is presented, in section 4 the proposed method parallel 
FPSO+FGA and the fuzzy systems are described, in section 5 we can appreciate 
the mathematical functions that were used for this research, in section 6 the ex- 
perimental results are described, in section 7 the conclusions obtained after the 
study of the proposed method are presented. 

2 Genetic Algorithms for Optimization 

Holland, from the University of Michigan initiated his work on genetic algorithms 
at the beginning of the 1960s. His first achievement was the publication of 
Adaptation in Natural and Artificial System [7] in 1975. 

He had two goals in mind: to improve the understanding of natural adaptation proc- 
ess, and to design artificial systems having properties similar to natural systems [8]. 

The basic idea is as follows: the genetic pool of a given population potentially 
contains the solution, or a better solution, to a given adaptive problem. This solution 
is not "active" because the genetic combination on which it relies is split between 
several subjects. Only the association of different genomes can lead to the solution. 

Holland's method is especially effective because it not only considers the role of 
mutation, but it also uses genetic recombination, (crossover) [9]. The crossover of 
partial solutions greatly improves the capability of the algorithm to approach, and 
eventually find, the optimal solution. 

The essence of the GA in both theoretical and practical domains has been well 
demonstrated [1]. The concept of applying a GA to solve engineering problems is 
feasible and sound. However, despite the distinct advantages of a GA for solving 
complicated, constrained and multi-objective functions where other techniques may 
have failed, the full power of the GA in application is yet to be exploited [12] [14]. 

In figure 1 we show the reproduction cycle of the Genetic Algorithm. 



Offspring 



Population 




Fig. 1. The Reproduction cycle 
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The Simple Genetic Algorithm can be expressed in pseudo code with the 
following cycle: 

1. Generate the initial population of individuals aleatorily P(0). 

2. While (number _ generations <= maximum _ numbers _ generations) 

Do: 

I 
Evaluation; 
Selection; 
Reproduction; 
Generation ++; 

I 

3. Show results 
4. End 

3 Particle Swarm Optimization 

Particle swarm optimization (PSO) is a population based stochastic optimization 
technique developed by Eberhart and Kennedy in 1995, inspired by the social 
behavior of bird flocking or fish schooling [3], 

PSO shares many similarities with evolutionary computation techniques such as 
Genetic Algorithms (GA) [6]. The system is initialized with a population of ran- 
dom solutions and searches for optima by updating generations. However, unlike 
the GA, the PSO has no evolution operators such as crossover and mutation. In 
PSO, the potential solutions, called particles, fly through the problem space by 
following the current optimum particles [10]. 

Each particle keeps track of its coordinates in the problem space, which are as- 
sociated with the best solution (fitness) it has achieved so far (The fitness value is 
also stored). This value is called pbest. Another "best" value that is tracked by 
the particle swarm optimizer is the best value, obtained so far by any particle in 
the neighbors of the particle. This location is called Ibest. When a particle takes all 
the population as its topological neighbors, the best value is a global best and is 
called gbest. 

The particle swarm optimization concept consists of, at each time step, chang- 
ing the velocity of (accelerating) each particle toward its pbest and Ibest locations 
(local version of PSO). Acceleration is weighted by a random term, with separate 
random numbers being generated for acceleration toward pbest and Ibest locations 
[2]. 

In the past several years, PSO has been successfully applied in many research 
and application areas. It is demonstrated that PSO gets better results in a faster, 
cheaper way compared with other methods [11]. 

Another reason that PSO is attractive is that there are few parameters to adjust. 
One version, with slight variations, works well in a wide variety of applications. 
Particle swarm optimization has been used for approaches that can be used across 
a wide range of applications, as well as for specific applications focused on a 
specific requirement. 
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The pseudo code of the PSO is as follows 
For each particle 
Initialize particle 
End 
Do 

For each particle 
Calculate fitness value 

If the fitness value is better than the best fitness value (pBest) in history 
set current value as the new pBest 
End 

Choose the particle with the best fitness value of all the particles as the 
gBest 
For each particle 

Calculate particle velocity 
Update particle position 
End 
While maximum iterations or minimum error criteria is not attained 

4 Full Model of the Parallel FPSO+FGA 

The general approach of the proposed method FPSO+FGA can be seen in figure 2 
with a sequential approach. In figure 3 it can be seen the parallel method. The 
parallel approach of the method can be described as follows: 

1 . It receives a mathematical function to be optimized 

2. Four cores (called corel, core2, core3 and core4) are working in parallel 
mode to run the method. 

3. Each core has the work of run the FPSO+FGA, beginning at same time 
and sending the results to matlab client (view figure 3). 

4. The process in each core can be described as follow: 

a. It evaluates the role of both GA and PSO. 

b. A main fuzzy system is responsible for receiving values 
resulting from step 2. 

c. The main fuzzy system decides which method to take (GA or 
PSO) 

d. After, another fuzzy system receives the Error and DError as in- 
puts to evaluates if is necessary change the parameters in GA or 
PSO. 

e. There are 3 fuzzy systems. One is for decision making (is called 
main fuzzy), the second one is for changing parameters of the 
GA (is called fuzzyga) in this case change the value of crossover 
(kl) and mutation (k2) and the third fuzzy system is used to 
change parameters of the PSO (is called fuzzypso) in this case 
change the value of social acceleration (cl) and 
cognitive acceleration (c2). 

5. The main fuzzy system decides in the final step the optimum value for 
the function introduced in step 1 . 
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6. Repeat the above steps until the termination criterion of the algorithm is 
met. 

7. After, the results obtained by each core are sent to the matlab client. 

8. Finally the comparison results between all cores are made in the matlab 

client. 




Fig. 2. The FPSO+FGA scheme 
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Fig. 3. The parallel FPSO+FGA scheme 



The basic idea of the FPSO+FGA scheme is to combine the advantage of the indi- 
vidual methods using a fuzzy system for decision making and the others two fuzzy 
systems to improve the parameters of the GA and PSO when is necessary and 
parallel computing to obtain more results in a little time. 
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As can be seen in the proposed hybrid FPSO+FGA method, it is the internal 
fuzzy system structure, which has the primary function of receiving as inputs 
(Error and DError) the results of the FGA and FPSO outputs. The fuzzy system is 
responsible for integrating and decides which are the best results being generated 
at run time of the FPSO+FGA. It is also responsible for selecting and sending the 
problem to the "fuzzypso" fuzzy system when the FPSO is activated or to the 
"fuzzyga" fuzzy system when FGA is activated. Also activating or temporarily 
stopping depending on the results being generated. Figure 4 shows the member- 
ship functions of the main fuzzy system that is implemented in this method. The 
fuzzy system is of Mamdani type because it is more common in this type of fuzzy 
control and the defuzzification method is the centroid. In this case, we are using 
this type of defuzzification because in other papers we have achieved good results 
[4] . The membership functions are of triangular form in the inputs and outputs as 
is shown in figure 4. Also, the membership functions were chosen of triangular 
form based on past experiences in this type of fuzzy control. The fuzzy system 
consists of 9 rules. For example, one rule is if error is P and DError is P then 
best value is P (view figure 5). Figure 6 shows the fuzzy system rule viewer. 
Figure 7 shows the surface corresponding to this fuzzy system. The other two 
fuzzy systems are similar to the main fuzzy system. 
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Fig. 4. Fuzzy system membership functions 




Fig. 5. Fuzzy system rules 
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Fig. 6. Fuzzy system rules viewer 




Fig. 7. Surface of fuzzy system 



4.1 FPSO (Fuzzy Particle Swarm Optimization) 

This section presents a detailed description of the FPSO model. The classical rep- 
resentation scheme for GAs is binary vectors of fixed length. In the case of an 
n x - dimensional search space, each individual consists of n x variables with each 
variable encoded as a binary string. 

The swarm is typically modeled by particles in multidimensional space that 

have a position and a velocity. These particles fly through hyperspace (i.e., R" ) 
and have two essential reasoning capabilities: their memory of their own best posi- 
tion and knowledge of the global or their neighborhood's best. In a minimization 
optimization problem, "best" simply meaning the position with the smallest objec- 
tive value. Members of a swarm communicate good positions to each other 
and adjust their own position and velocity based on these good positions. So a par- 
ticle has the following information to make a suitable change in its position and 
velocity: 

A global best that is known to all and immediately updated when a new best 
position is found by any particle in the swarm. 

The neighborhood best that the particle obtains by communicating with a subset 
of the swarm. 
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The local best, which is the best solution that the particle has seen. 

In this case, the social information is the best position found by the swarm, 

A 

referred as y (t). For gbest FPSO, the velocity of particle i is calculated as 

A 

v.. (t + 1) = vij (t) + cfij (t)[ y v (t) - x tj (0] + c 2 r 2 . (t)[ y . (t) - x tj (t)] (1) 

Where vi At) is the velocity of particle i in dimension j = l,...,n x at time step t, 

xi- (t) is the position of particle i in dimension j at time step t, Ci and C 2 repre- 
sents the cognitive and social acceleration. In this case, these values are fuzzy 
because they are changing dynamically when the FPSO is running, and r { (t), 

r 2 ~U(0,1) are random values in the range [0,1]. 

4.2 FGA (Fuzzy Genetic Algorithm) 

This section presents a detailed description of the FGA. Several crossover opera- 
tors have been developed for GAs, depending on the format in which individuals 
are represented. For binary representations, uniform crossover, one point cross- 
over and two points cross over are the most popular. In this case we are using two 
points crossover with fuzzy crossover rate because we are adding a fuzzy system 
called 'fuzzyga' that is able of change the crossover and mutation rate. 

4.3 Definition of the Fuzzy Systems Used in FPSO+FGA 

'fuzzypso': In this case we are using a fuzzy system called 'fuzzypso', and 
the structure of this fuzzy system is as follow: 
Number of Inputs: 2 
Number of Outputs: 2 
Number of membership functions: 3 
Type of the membership functions: Triangular 
Number of rules: 9 
Defuzzification: Centroid 

The main function of the fuzzy system called 'fuzzypso' is to adjust the 
parameters of the PSO. In this case, we are adjusting the following parameters: 
'cf and 'c 2 ' ; where: 
'Ci' = Cognitive Acceleration 
'c 2 ' = Social Acceleration 

We are changing these parameters to test the proposed method. In this case, with 
'fuzzypso' is possible to adjust in real time the 2 parameters that belong to the 
PSO. 
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'fuzzyga': In this case we are using a fuzzy system called 'fuzzyga', the structure 
of this fuzzy system is as follows: 
Number of Inputs: 2 
Number of Outputs: 2 
Number of membership functions: 3 
Type of membership functions: Triangular 
Number of rules: 9 
Defuzzification: Centroid 

The main function of the fuzzy system called 'fuzzypso' is to adjust the 
parameters of the GA. In this case, we are adjusting the following parameters: 
'ki', 'k 2 '; where: 
'k,' = mutation 
'k 2 ' = crossover 

'fuzzymain': In this case, we are using a fuzzy system called 'fuzzymain'. The 
structure of this fuzzy system is as follows: 
Number of Inputs: 2 
Number of Outputs: 1 
Number of membership functions: 3 
Type of membership functions: Triangular 
Number of rules: 9 
Defuzzification: Centroid 

The main function of the fuzzy system, called 'fuzzymain' is to decide on the best 
way for solving the problem, in other words if it is more reliable to use the FPSO 
or FGA. This fuzzy system is able to receive two inputs, called error and derror, it 
is to evaluate the results that are generated by FPSO and FGA in the last step of 
the algorithm. 

5 Benchmark Mathematical Functions 

To validate our method we used a set of 5 benchmark mathematical functions; all 
functions were evaluated in parallel mode with different numbers of variables, 
in this case, the experimental results were obtained with 16, 32, 64 and 128 
variables. 

5.1 De Jong's Function 1 

The simplest test function is De Jong's function 1. It is also known as sphere 
model, and is defined by this equation: 



f x — £, X n minimum = 0, for -oo<x<oo /j\ 

n-i 

The visualization of this function it can be seen in figure 8. 
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Fig. 8. De Jong's function 
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Fig. 9. Rotated hyper-ellipsoid function 

5.2 Rotated Hyper-ellipsoid Function 

The visualization of Rotated hyper-ellipsoid function; surf/mesh plot of the first 
two variables in an area from -50 to 50 it can be seen in figure 9, and is defined by 
this equation: 
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/(*)=z[i>j 



minimum = 0, for -65.536<xi<65.536 



(3) 



5.3 Rosenbrock's Valley (De Jong's Function 2) 

The visualization of Rosenbrock's function can be seen in figure lO.Rosenbrock's 
valley is a classic optimization problem, also known as Banana function. The 
global optimum is inside a long, narrow, parabolic shaped flat valley. The conver- 
gence to the global optimum is difficult and hence this problem has been repeat- 
edly used in assess the performance of optimization algorithms. The equation is 
defined by: 



«-] 



f(x) = ^00.(x i+l -xf) +(l-x,) 2 



(4) 



Minimum = 0, for-2.048<xi<2.048 



4000 




variable 2 



-2 -2 
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Fig. 10. Rosenbrock's valley function 

5.4 Rastrigin's Function 

Rastrigin's function is based on function 1 with the addition of cosine modulation 
to produce many local minima. Thus, the test function is highly multimodal. The 
equation is defined by: 
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/(x) = 10.n + ^(x, 2 -10.cos(2^x,.)) 



minimum = 0, for -oo<x<oo 



(5) 




variable 2 



Fig. 11. Rastrigin's function 



5.5 Grie wanks Function 



Griewank's function is similar to Rastrigin's function. In figure 12 we can see the 
visualization for this function. It has many widespread local minima. However, the 
location of the minima are regularly distributed. The equation is defined by this 
equation: 



X, 



fix) = y — - — cos 

tT 4000 



r 3? 



+1 



(6) 



minimum = 0, for -oo<x<co 



Fuzzy Control for Dynamical Parameter Adaptation 



173 




variable 2 



-SO -SO 





variable 1 
Fig. 12. Griewank's functions 



6 Experimental Results 

Several tests of the FPSO+FGA algorithms were made with an implementation in 
the Matlab programming language. 

The implementation was developed using a computer with processor Intel Core 
2 Quad of 64 bits that works to a frequency of clock of 2.5 GHz, 6 GB of RAM 
Memory and Ubuntu Linux Operating System. The results obtained after applying 
the GA to the mathematical functions are shown in tables 1 to 10: 

The parameters used in the Tables are: 
V= Number of variables used to evaluation. 
BEST VALUE= The best result obtained. 
AVERAGE= The average of 50 times in each core. 
WORST VALUE= The worst result obtained. 

The parameters used initially on all tests with the GA were: population size = 100 
individuals, crossover (k]) = 80%, mutation (k 2 ) = 5%, selection= roulette. On 
PSO the parameters were: swarm size= 100 particles, cognitive acceleration 
(cl) = 1, social acceleration (c2) = 0.5, value of velocity at the beginning = 0.95, 
constriction factor = 1. On execution time in FPSO+FGA the parameters were 
changing for k b k 2 , Cj and c 2 , this four parameters were fuzzy parameters, because 
were obtained dynamically when the method is running, the fuzzy systems can be 
obtaining this four parameters the best way for achieve good results. 
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6.1 Experimental Results with FPSO+FGA 

The results obtained after applying the proposed method FPSO+FGA to the mathe- 
matical functions are shown in tables 1 to 10. One simulation results is shown in 
figure 13 in parallel mode. The use of CPU is shown in figure 14, we can observe 
when the method is running the four CPUs are working the maximum. 
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Fig. 13. Simulations results for one test function in parallel mode 
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Fig. 14. CPU usage 
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Table 1 shows the experimental results for the De Jong's function. The table 
shows the number of variables for the different evaluations; it can be seen the best 
and worst value obtained, the average of 50 times after executing the method by 
each core. The time of execution by each core is shown in table 2. The minimum 
value for this function is in 0, however, we can see in the table 3, the best values 
obtained with the proposed method; we can observe when the number of variables 
is increased, is more complex by the method to obtain good results when the pa- 
rameters are statics, therefore, the adjust of parameters is necessary in this cases 
and the proposed method achieved better results because the parameters are 
changed in execution time. 

Table 1, Experimental results with FPSO+FGA for the De Jong's function 



V 


BEST 

VALUE 


AVERAGE 


WORST 
VALUE 






Corel 


Core2 


Core3 


Core4 




16 


0.00001 
(Corel) 


0.0002 


0.1874 


0.0001 


0.0034 


0.9205 
(Core2) 


32 


0.0299 
(Core3) 


0.0148 


0.0808 


0.0458 


0.0111 


1.5926 
(Core2) 


64 


0.0123 
(Core2) 


0.2529 


0.3346 


0.9649 


0.3856 


1.79 
(Core3) 


128 


0.1681 
(Core3) 


0.6030 


0.6703 


0.2060 


0.2954 


2.089 
(Core2) 



Table 2. Time in minutes (De Jong's function) 



V 


Time in minutes 


Corel 


Core2 


Core3 


Core4 


16 


0.8 


0.8 


0.7 


0.8 


32 


1.11 


1.11 


1.10 


1.11 


64 


2.01 


2.01 


2.01 


2.011 


128 


3.05 


3.06 


3.07 


3.07 



Table 3. Experimental results with FPSO+FGA for the hyper-ellipsoid function 



V 


BEST 

VALUE 


AVERAGE 


WORST 
VALUE 






Corel 


Core2 


Core3 


Core4 




16 


0.0271 
(Corel) 


0.3774 


0.7951 


0.2905 


0.2784 


1.4401 
(Corel) 


32 


0.0690 
(Core2) 


0.9970 


1.3363 


1.3798 


1.0722 


8.4104 
(Core3) 


64 


2.1667 
(Corel) 


4.0698 


5.8161 


4.0255 


4.9087 


42.872 
(Core4) 


128 


3.0999 

(Core2) 


7.7459 


6.3345 


4.9908 


8.9876 


78.09 
(Core2) 
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Table 4. Time in minutes (hyper-ellipsoid function) 



V 


Time in minutes 


Corel 


Core2 


Core3 


Core4 


16 


0.36 


0.38 


0.38 


0.38 


32 


1.16 


1.20 


1.22 


1.16 


64 


1.41 


1.39 


1.40 


1.42 


128 


3.01 


3.03 


3.04 


3.04 



In table 5 it can be seen the simulations results for the Rosenbrock's valley 
function. Table 6 show the time of execution for this test function in parallel mode 

Table 5. Experimental results with FPSO+FGA for the Rosenbrock's function 



V 


BEST 

VALUE 


AVERAGE 


WORST 
VALUE 






Corel 


Core2 


Core3 


Core4 




16 


0.0110 
(Core3) 


0.0456 


0.0234 


0.0768 


0.0925 


2.3001 
(Core2) 


32 


0.3422 
(Corel) 


1.0023 


1.5631 


1.6505 


1.0994 


9.8790 
(Core4) 


64 


2.534 
(Core2) 


3.0568 


3.4456 


4.5673 


5.9807 


7.7765 
(Core3) 


128 


2.9909 
(Core2) 


8.3456 


7.8956 


6.0676 


6.7878 


9.0456 
(Core2) 



Table 6. Time in minutes(Rosenbrock's function) 



V 


Time in minutes 


Corel 


Core2 


Core3 


Core4 


16 


1.02 


1.05 


1.04 


1.07 


32 


1.20 


1.20 


1.23 


1.25 


64 


1.55 


1.55 


1.57 


1.59 


128 


3.50 


3.49 


3.47 


3.50 



In table 7, we can appreciate the simulation results in parallel mode for the 
Rastrigin's function. In Table 8, we show the time of execution of the method for 
this test function. 



Table 7. Experimental results with FPSO+FGA for the Rastrigin's function 



V 


BEST 

VALUE 


AVERAGE 


WORST 
VALUE 






Corel 


Core2 


Core3 


Core4 




16 


0.0110 
(Corel) 


0.0065 


0.0098 


0.0076 


0.0045 


2.3001 
(Core3) 


32 


0.0012 
(Corel) 


1.0023 


0.9988 


0.9760 


0.7890 


3.3345 
(Core4) 


64 


0.9766 
(Corel) 


2.6777 


2.3455 


2.3443 


2.3465 


5.6666 
(Core4) 


128 


1.01 

(Corel) 


3.4560 


3.4433 


3.6678 


4.0001 


10.098 
(Core3) 
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Table 8. Time in minutes (Rastrigin's function) 



V 


Time in minutes 


Corel 


Core2 


Core3 


Core4 


16 


1.01 


1.01 


1.02 


1.05 


32 


1.30 


1.31 


1.23 


1.26 


64 


1.58 


1.59 


2.00 


2.01 


128 


3.30 


3.31 


3.32 


3.30 



In table 9, we can appreciate the simulation results in parallel mode for the 
Griewank's function. In Table 10, we show the time of execution of the method 
for this test function. 

Table 9. Experimental results with FPSO+FGA for the Griewank's function 



V 


BEST 

VALUE 


AVERAGE 


WORST 
VALUE 






Corel 


Core2 


Core3 


Core4 




16 


0.0345 
(Corel) 


0.3333 


0.3345 


0.7768 


0.9980 


1.0989 
(Core3) 


32 


0.1045 
(Corel) 


1.9800 


0.9761 


0.8801 


0.8809 


4.5678 
(Core4) 


64 


0.9981 
(Corel) 


2.0983 


2.0987 


2.0967 


2.2345 


6.4561 
(Core4) 


128 


1.5567 
(Corel) 


4.3245 


5.7891 


4.8907 


5.6789 


12.980 
(Core3) 



Table 10. Time in minutes (Griewank's function) 



V 


Time in minutes 


Corel 


Core2 


Core3 


Core4 


16 


1.01 


1.01 


1.02 


1.05 


32 


1.30 


1.31 


1.23 


1.26 


64 


1.58 


1.59 


2.00 


2.01 


128 


3.30 


3.31 


3.32 


3.30 



7 Conclusions 



The analysis of the experimental results of the evolutionary method considered in 
this paper, parallel FPSO+FGA lead us to the conclusion that for the optimization 
of this benchmark mathematical function with this method is a good alternative 
because it is easier and very fast to optimize achieve good results than to try it 
with PSO or GA separately [5]. This is, because the combination PSO and GA 
with fuzzy rules gives a parallel hybrid method FPSO+FGA. Also, we can observe 
the method is very fast for the execution, with architecture of four cores, we can 
generate four results in the same time. 



178 F. Valdez, P. Melin, and O. Castillo 

Acknowledgment 

We would like to express our gratitude to the CONAC YT, Institute of Technology 
for the facilities and resources granted for the development of this research. 

References 

[1] Man, K.F., Tang, K.S., Kwong, S.: Genetic Algorithms: Concepts and Designs. 
Springer, Heidelberg (1999) 

[2] Eberhart, R.C., Kennedy, J.: A new optimizer using particle swarm theory. In: 
Proceedings of the Sixth International Symposium on Micromachine and Human 
Science, Nagoya, lapan, pp. 39—43 (1995); Lu, I.-G.: Title of paper with only the 
first word capitalized. J. Name Stand. Abbrev. (in press) 

Kennedy, I., Eberhart, R.C.: Particle swarm optimization. In: Proceedings of IEEE 
International Conference on Neural Networks, Piscataway, NJ, pp. 1942-1948 
(1995) 

Holland, J.H.: Adaptation in natural and artificial system. The University of 
Michigan Press, Ann Arbor (1975) 

Valdez, F., Melin, P.: Parallel Evolutionary Computing using a cluster for Mathe- 
matical Function Optimization, Nafips. San Diego CA, USA, 598-602 (lune 2007) 
Castillo, O., Melin, P.: Hybrid intelligent systems for time series prediction using 
neural networks, fuzzy logic, and fractal theory. IEEE Transactions on Neural 
Networks 13(6), 1395-1408 (2002) 

Fogel, D.B.: An introduction to simulated evolutionary optimization. IEEE transac- 
tions on neural networks 5(1), 3-14 (1994) 

Goldberg, D.: Genetic Algorithms. Addison Wesley, Reading (1988) 
Emmeche, C: Garden in the Machine. The Emerging Science of Artificial Life, 
p. 114. Princeton University Press, Princeton (1994) 

Angeline, P.J.: Using Selection to Improve Particle Swarm Optimization. In: 
Proceedings 1998 IEEE World Congress on Computational Intelligence, Anchorage, 
Alaska, pp. 84-89. IEEE, Los Alamitos (1998) 

Angeline, P. J.: Evolutionary Optimization Versus Particle Swarm Optimization: 
Philosophy and Performance Differences. In: Porto, V.W., Waagen, D. (eds.) 
EP 1998. LNCS, vol. 1447, pp. 601-610. Springer, Heidelberg (1998) 
Back, T., Fogel, D.B., Michalewicz, Z. (eds.): Handbook of Evolutionary Computa- 
tion. Oxford University Press, Oxford (1997) 

Montiel, O., Castillo, O., Melin, P., Rodriguez, A., Sepulveda, R.: Human evolution- 
ary model: A new approach to optimization. Inf. Sci. 177(10), 2075-2098 (2007) 
Castillo, O., Valdez, F., Melin, P.: Hierarchical Genetic Algorithms for topology op- 
timization in fuzzy control systems. International lournal of General Systems 36(5), 
575-591 (2007) 



[3 

[4; 

[5 

[6; 

[7 

[8 
[9 

[10 

[11 

[12 
[13 

[14; 



Part III 
Optimization of Fuzzy Controllers 



Optimization of Type-2 Fuzzy Logic Controllers Using 
PSO Applied to Linear Plants 

Ricardo Martinez 1 , Oscar Castillo 2 , Luis T. Aguilar 3 , and Antonio Rodriguez 1 

1 Universidad Autonoma de Baja California Tijuana, Mexico 

mc . ricardo . martinez@hotmail 
~ Tijuana Institute of Technology, Tijuana Mexico 

ocastillo@tecti juana.mx 
3 Instituto Politecnico Nacional, Centro de Investigacion y Desarrollo de Tecnologia 

Digital, Ave. Del Parque 1310, Mesa de Otay, Tijuana B.C. 22510 

luis . aguilar@ieee . org 



Abstract. We use the Particle Swarm Optimization (PSO) method to find the 
parameters of the membership functions of a type-2 fuzzy logic controller (Type-2 
FLC) in order to minimize the state error for linear systems. PSO is used to find 
the optimal Type-2 FLC to achieve regulation of the output and stability of the 
closed-loop system. For this purpose, we change the values of the cognitive, social 
and inertia variables in the PSO. Simulation results, with the optimal FLC 
implemented in Simulink, show the feasibility of the proposed approach. 

Keywords: PSO, Fuzzy Logic Optimizations. 



1 Introduction 

Optimization algorithms are search methods, where the goal is to find a solution to 
an optimization problem, such that a given quantity is optimized, possibly subject 
to a set of constraints [6]. Some optimization methods are based on populations of 
solutions. Unlike the classic methods of improvement for trajectory tracking, in 
this case each iteration of the algorithm has a set of solutions. These methods are 
based on generating, selecting, combining and replacing a set of solutions. Since 
they maintain and they manipulate a set, instead of a unique solution throughout 
the entire search process, they used more computer time than other metaheuristic 
methods. This fact can be aggravated because the "convergence" of the population 
requires a great number of iterations. For this reason a concerted effort has been 
dedicated to obtaining methods that are more aggressive and manage to obtain 
solutions of quality in a nearer horizon. This paper is concerned with bio-inspired 
optimization methods like particle swarm optimization (PSO) to design optimized 
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fuzzy logic controllers (FLC) for linear problems. This method is used to find 
the parameters of the membership functions obtaining the optimal FLC for plant 
control. 

This paper is organized as follows: Section 2 presents the theoretical basis and 
problem statement. Section 3 introduces the controller design where a PSO is used 
to select the parameters. Robustness properties of the closed-loop system are 
achieved with a type-2 fuzzy logic control system using a Takagi-Sugeno model 
where the error and the change of error, are considered the linguistic variables. 
Section 4 provides a simulation study of the plant using the controller described in 
Section 3. Finally, Section 5 presents the conclusion. 

2 Theoretical Basis and Problem Statement 

Particle Swarm Optimization (PSO). PSO is a population based stochastic 
optimization technique developed by Eberhart and Kennedy in 1995, inspired by 
social behavior of bird flocking or fish schooling [5]. PSO shares many similari- 
ties with evolutionary computation techniques such as Genetic Algorithms 
(GA)[7]. 

The system is initialized with a population of random solutions and searches 
for optima by updating generations. However, unlike the GA, the PSO has no evo- 
lution operators such as crossover and mutation. In the PSO, the potential solu- 
tions, called particles, fly through the problem space by following the current 
optimum particles [2], Each particle keeps track of its coordinates in the problem 
space, which are associated with the best solution (fitness) it has achieved so far 
(The fitness value is also stored). This value is called pbest. Another "best" value 
that is tracked by the particle swarm optimizer is the best value, obtained so far by 
any particle in the neighbors of the particle. This location is called Ibest. When a 
particle takes all the population as its topological neighbors, the best value is a 
global best and is called gbest [10]. 

The particle swarm optimization concept consists of, at each time step, chang- 
ing the velocity of (accelerating) each particle toward its pbest and Ibest locations 
(local version of PSO). Acceleration is weighted by a random term, with separate 
random numbers being generated for acceleration toward pbest and Ibest locations 
[9]. In the past several years, PSO has been successfully applied in many research 
and application areas. It is demonstrated that PSO gets better results in a faster, 
cheaper way compared with other methods [1], [9]. Another reason that PSO is 
attractive is that there are few parameters to adjust. One version, with slight varia- 
tions, works well in a wide variety of applications. Particle swarm optimization 
has been used for approaches that can be used across a wide range of applications, 
as well as for specific applications focused on a specific requirement [9] . 
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The basic algorithm of the PSO has the next nomenclature: 

i 

Y -Particle position 
y -Particle velocity 

i 

n -Best "remembered" individual particle position 

g 
T) -Best "remembered" swarm position 

p ,p -Cognitive and Social parameters 

y , y -Random numbers between and 1 
The equation for calculate the velocity is: 



v k+ rv k + c l r l 



g 



p-x k + c 2 r 2 p-x, 



(i) 



and the position of individuals particles is updated as follows: 



Xk+l Xk Vk+l 

The basic PSO algorithm flow diagram as follows: 

1) Initialize 

a) Set constants h- , n , n 

b) Randomly initialize particle position xn e D in R n for i — 1,..., p 

c) Randomly initialize particle velocities < vn — Vr? f° r 

i = \,...,p 

d) Set k = 1 

2) Optimize 

a) Evaluate function value J using design space coordinates -y 

b) If f < f then f = f , T) = r \ . 

J k J best J best J k f k • A/k 

/' pg pg pi g i 

< f then / = f ,n = y, 
k J best J best J k f k ■ A/k 

d) If stopping condition is satisfied then goto 3. 

e) Update all particle velocities -y fori — I,..., p 

f) Update al particle positions y iori — l,...,p 

g) Increment k. 
h) Goto 2(a). 
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3) Terminate 

Problem Statement. To test the optimized FLCs obtained by the bio-inspired 
methods; we used different linear systems. We first consider two benchmark prob- 
lems called Plant 1 and Plant 2 with different levels of complexity. Fig 1 shows 
the Simulink bloc diagram used to simulate the Plants. 

Plant 1 is given by the following second order transfer function: 



g(s) = 



2 



s + 2e Wn s + w n 



2 e = 0.5, Wn = 2 (3) 



where Wn is the natural frequency and £ is the coefficient damping. 
Plant 2 is given by the following transfer function: 



; 2 + 4 



(4) 
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Fig. 1. Simulink bloc diagram of the Plant 1 and Plant 2 



3 Fuzzy Logic Control Design 

In this section we design a fuzzy logic controller (FLC) where the optimal control- 
ler was found with the first evolutionary method, which in this case is the genetic 
algorithm. 

For the FLC a Takagi-Sugeno type of fuzzy system is used with two inputs a) 
error, and b) change of error, with three membership functions each input, "Nega- 
tive, Zero and Positive" (Gaussian and triangular), one output which are constant 
values, and nine fuzzy rules (IF - THEN) [3],[4],[8],[11],[12],[13],[14]. Fig 2 
shows the FLC membership functions for the plant control. Once we obtained the 
FLC design, we used an optimization method to find the optimal Controller. 
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Fig. 2. a) input 1 "error", b) input 2 "error change" 

Table 1 show the parameters of the membership functions, the minimal and the 
maximum values in the search range for the PSO method to find the best type-1 
fuzzy controller system and table 2 shows the parameters of the membership 
function of the type-2 FLC. 

Table 1. Parameters of the membership functions for type-1 FLC 



Plant 1 


Plant 2 


MFType 


Point 


Minimum 
Value 


Maximum 
Value 


MFType 


Point 


Minimum 
Value 


Maximum 
Value 


Gaussian 


a 


0.3 


0.6 


Gaussian 


a 


1.8 


2.8 


b 


-1.2 


-0.8 


b 


-6 


-4 


Triangular 


a 


-0.8 


-0.3 


Triangular 


a 


-3 


-0.5 


b 








b 








c 


0.3 


0.8 


c 


0.5 


3 


Gaussian 


a 


0.3 


0.6 


Gaussian 


a 


1.8 


2.8 


b 


0.8 


1.2 


b 


4 


6 



Table 2. Parameters of the membership functions for type-2 FLC 



Plant 1 


Plant 2 


MFType 


Point 


Minimum 
Value 


Maximum 
Value 


MFType 


Point 


Minimum 
Value 


Maximum 
Value 


Gaussian 


a 


0.13 


0.14 


Gaussian 


a 


1.8 


2.8 


b 


-1.2 


-1 


b 


-7 


-3 


a' 


0.22 


0.43 


a' 


1.5 


1.6 


b' 


-1.2 


-1 


b' 


-7 


-3 


Triangular 


a 


-1 


-0.75 


Triangular 


a 


-3 


-2.5 


b 








b 








c 


0.15 


0.25 


c 


0.5 


2 


a' 


-0.75 


-0.25 


a' 


-2 


-1 


b' 








b' 








c' 


0.5 


0.75 


c' 


1 


3 


Gaussian 


a 


0.13 


0.14 


Gaussian 


a 


1.8 


2.8 


b 


1 


1.2 


b 


3 


7 


a' 


0.22 


0.43 


a' 


1.5 


1.6 


b' 


1 


1.2 


b' 


3 


7 
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4 Simulation Results 

In this section, we evaluate, through computer simulations performed in 
MATLAB® and SIMULINK®, the designed FLC for the two plants using the 
PSO optimization method. We use random values in the cognitive (CI), social 
(C2) and inertia parameters of the PSO. 

4.1 PSO with Type-1 FLC for Plant 1 

Table 3 presents the main results of the FLC obtained with the PSO showing in 
the first row the best result. 

Table 3. Results of the Type-1 FLC for Plant 1 obtained by PSO 



No. 


Swarm 
(Population) 


Max 
Iterations 


CI 


C2 


Inertia 


Time exec 


Average 
error 


1 


200 


70 


0.4542 


0.5052 


0.9038 


0:26:26 


0.081179 


2 


200 


70 


0.9667 


0.3883 


0.8501 


0:44:46 


0.086795 


3 


200 


70 


0.183 


0.8722 


0.9289 


0:09:23 


0.087324 


4 


200 


70 


0.6798 


0.7371 


0.6119 


0:54:22 


0.089289 


5 


200 


70 


0.0016 


0.8994 


0.6983 


0:45:59 


0.090177 


6 


200 


70 


0.484 


0.704 


0.7194 


0:49:16 


0.092952 


7 


200 


70 


0.7046 


0.9151 


0.9988 


0:05:59 


0.095280 


8 


200 


70 


0.1344 


0.3959 


0.2534 


0:58:29 


0.096134 


9 


200 


70 


0.1223 


0.5346 


0.0681 


0:57:48 


0.103938 


10 


200 


70 


0.1589 


0.0366 


0.6223 


1:04:45 


0.109765 



Fig 3 shows the Particles behavior of PSO giving the best type-1 FLC for con- 
trolling the plant 1 and fig 4 shows the membership functions of the optimized 
controller. 
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Fig. 3. Behavior of PSO particles of Type-1 FLC optimization 



Optimization of Type-2 Fuzzy Logic Controllers Using PSO 



187 




Fig. 4. Input a "error" and input b "errordev" of the optimized FLC 

The simulations result of the optimized Type-1 FLC for Plant 1 is showing in 
the fig 5. 
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Fig. 5. Simulations result of type-1 FLC for Plant 1 

4.2 PSO with Type-1 FLC for Plant 2 

Table 4 presents the main results of the FLC obtained with the PSO showing in 
the first row the best result. 
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Table 4. Results of the Type-1 FLC for Plant 2 obtained by PSO 



No. 


Swarm 
(Population) 


Max 
Iterations 


CI 


C2 


Inertia 


Time exec 


Average 
error 


1 


200 


70 


0.0429 


0.6746 


0.8477 


0:34:32 


0.131933 


2 


200 


70 


0.7580 


0.1716 


0.5573 


0:49:17 


0.136293 


3 


200 


70 


0.7488 


0.0884 


0.9051 


0:39:05 


0.141524 


4 


200 


70 


0.8149 


0.9059 


0.1706 


0:49:38 


0.148655 


5 


200 


70 


0.0660 


0.7696 


0.5558 


0:44:32 


0.150427 


6 


200 


70 


0.8660 


0.1036 


0.5554 


1:05:25 


0.151731 


7 


200 


70 


0.5543 


0.6067 


0.4974 


0:50:25 


0.152046 


8 


200 


70 


0.7154 


0.2479 


0.5130 


0:55:45 


0.154302 


9 


200 


70 


0.0158 


0.0882 


0.2155 


1:02:43 


0.160239 


10 


200 


70 


0.4458 


0.9566 


0.3233 


0:51:22 


0.164745 



Fig 6 shows the particles behavior of PSO giving the best type-1 FLC for con- 
trolling the plant 2 and fig. 7 shows the membership functions of the optimized 
controller. 
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Fig. 6. Behavior of PSO particles of Type-1 FLC optimization 




Fig. 7. Input a "error" and input b "errordev" of the optimized FLC 
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The simulation result of the optimized Type-1 FLC for Plant 2 is shown in fig 8. 
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Fig. 8. Simulations result of type- 1 FLC for Plant 2 

4.3 PSO with Type-2 FLC for Plant 1 

Table 5 presents the main results of the FLC obtained with PSO showing in the first 
row the best result. 

Table 5. Results of the Type-2 FLC for Plant 1 obtained by PSO 



No. 


Swarm 
(Population) 


Max 
Iterations 


CI 


C2 


Inertia 


Time exec 


Average 
error 


1 


200 


90 


0.5149 


0.3317 


0.6808 


7:05:20 


0.08028 


2 


200 


70 


0.8149 


0.9059 


0.1706 


12:44:19 


0.08794 


3 


200 


70 


0.8129 


0.8159 


0.1906 


10:52:09 


0.08894 


4 


200 


70 


0.7646 


0.9229 


0.1096 


9:50:45 


0.12521 


5 


200 


70 


0.8168 


0.9359 


0.4806 


11:05:43 


0.12630 



Fig 9 shows the particles behavior of PSO giving the best type-2 FLC for control- 
ling the plant 1 and fig 10 shows the membership functions of the optimized controller. 
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Fig. 9. Behavior of PSO particles of Type-2 FLC optimization 
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Fig. 10. Input a "error" and input b "errordev" of the optimized FLC 
The simulation result of the optimized Type-2 FLC for Plant 1 is shown in fig 11. 
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Fig. 11. Simulations result of type-2 FLC for Plant 1 



4.4 PSO with Type-2 FLC for Plant 2 

Table 6 presents the main results of the FLC obtained with PSO showing in the 
first row the best result. 

Table 6. Results of the Type-2 FLC for Plant 2 obtained by PSO 



No. 


Swarm 
(Population) 


Max 
Iterations 


CI 


C2 


Inertia 


Time exec 


Average 
error 


1 


200 


90 


0.5149 


0.3317 


0.6808 


7:05:20 


0.08028 


2 


200 


70 


0.8149 


0.9059 


0.1706 


12:44:19 


0.08794 


3 


200 


70 


0.8129 


0.8159 


0.1906 


10:52:09 


0.08894 


4 


200 


70 


0.7646 


0.9229 


0.1096 


9:50:45 


0.12521 


5 


200 


70 


0.8168 


0.9359 


0.4806 


11:05:43 


0.12630 



Fig 12 shows the Particles behavior of PSO giving the best type-2 FLC for control- 
ling the plant 1 and fig 13 shows the membership functions of the optimized controller. 
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Fig. 12. Behavior of PSO particles of Type-2 FLC optimization 
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Fig. 13. Input a "error" and input b "errordev" of the optimized FLC 
The simulation result of the optimized Type-2 FLC for Plant 2 is shown in fig 14. 
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Fig. 14. Simulations result of type-2 FLC for Plant 2 
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5 Conclusions 

We described in this paper the use of an optimization method for the simulation 
design of optimized FLCs. In particular we presented results for PSO in Type-1 
and Type-2 FLC optimization for linear plants. The simulation results show that 
the closed-loop system with the type-1 FLC obtained is stabilized in less than 10 
sec. On the other hand, the Type-2 FLCs obtained are better than the Type-1 
FLCs because they gets stabilization in less than 5 seconds, but with more over- 
shoot in the second Plant that the first one. The plots of the results show this 
difference. 

We have achieved satisfactory results with PSO; the next step is to solve the 
problem using Type-2 FLC in a perturbed environment and considering multiple 
objective optimization to obtain better results. Moreover, we will extend the 
results to nonlinear systems, like for autonomous mobile robots. 
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Abstract. This paper proposes the optimization of an incremental fuzzy PD 
controller based on a genetic algorithm, which optimizes the parameters of 
membership functions of the inputs and outputs. The multiobjective GA is 
considered because it evaluates three characteristics of the controller (steady state 
error, overshoot and undershoot). The application of this controller is to regulate 
the engine speed of a direct current (DC) motor. A methodology to test and 
validate this controller through Matlab-Simulink is presented. 

1 Introduction 

Fuzzy logic controllers are used successfully in many application areas, these 
include control, classification, etc. [1,2,3,5, 7,10,11]. These systems based on rules 
incorporate linguistic variables, linguistic terms and fuzzy rules. The acquisition 
of these is not an easy task for the expert and are of vital importance in the 
operation of the controller. 

Adjusting these linguistic terms and rules are usually done by trial and error, 
which implies a difficult task, there are methods to optimize those elements that 
over time have taken importance, such as genetic algorithms [6], 

This work presents the optimization of membership functions for incremental 
Fuzzy PD control based on genetic algorithms, the controller regulates the engine 
speed of DC Motor, and this application is tested in Matlab-Simulink. 

This paper is organized as follows, in section 2 we present an introductory ex- 
planation of Fuzzy Inference Systems and Genetic algorithms, section 3 describes 
the design and optimization of the fuzzy system, the experimental results of the 
optimized FLC are shown in Section 4. Finally, Section 5 presents the conclusions. 
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2 Preliminaries 

Fuzzy systems are every time used more and more, because they tolerate 
imprecise information and can be used to model nonlinear functions of arbitrary 
complexity. A fuzzy system (FIS) consists of three stages: Fuzzification, Inference 
and Defuzzification [13]. We describe below these stages. 

Fuzzification: is the interpretation of input values (numerics) by the fuzzy system, 
and the obtained output are fuzzy values. 

Definition 1. Let X6l be a linguistic variable and T t {x) a fuzzy set associated 
with a linguistic value 7] . The translation of a numeric value X corresponds to a 
linguistic value associated with a degree of membership, X — > jil T (x) , and this is 
known as Fuzzification. The membership degree jU T (x) represents a value of 
membership to a fuzzy set [14]. 

Inference: is basically like the brain of the system, here the rules of the form 
if-then that describe this behavior are used [2]. For example: 

If X x is A l and ... and X n is A n Then y is B (1) 

where X x ...X are the inputs, A l ...A n ,B are linguistic terms and y is the 

output. 

Defuzzification: consists in obtaining a numeric value to the output. This stage 
basically selects a point that is the most representative of the action to perform [2]. 
There are several methods to calculate the Defuzzification, such as the Center of 
Height (COA), Center of Gravity (COG), etc. The COG is shown in Equation 2. 

y = v" ^ (2) 

where h x is the maximum height of the consequent from rule i to rule N [2]. 
In Figure 1, the fuzzy system information processing is shown. 
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Fig. 1. Fuzzy System 

A genetic algorithm (GA) is a programming technique that mimics biological 
evolution to solve problems, GAs have proven to be a good strategy because of its 
optimal results [3,8]. 

The GA has applications in a wide variety of fields to develop solutions to 
complex problems, including optimization of fuzzy systems, offering them learn- 
ing and adaptation capabilities, these are commonly called genetic fuzzy systems 
or fuzzy system hybrids. 

3 Design and Optimization of the Fuzzy Logic Controller 

The Fuzzy Logic Controller (FLC) is designed for speed control of a DC motor 

and has two inputs and one output. The inputs are error (e (t)) and change of error 

(e '(t)), and the output is the control signal ( y (t)). 

The inputs are calculated as follows 

(3) 
e(t)=r(t)-y(t) 



E'(t)= eft) - e(t-l) 



(4) 



where t is the sampling time. 

The reference signal r(t), is given by: 



r(t) 



t > 
t < 



(5) 



Each input and output of the FIS has three linguistic terms. For the linguistic 
variable error and change of error, the terms are {NB, Z, PB} in this case NB is 
Negative Big, Z is Zero and PB is Positive Big. For the linguistic variable control 
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signal are {BD, H, BI} in this case BD is Big Decrement, H is Hold and BI is Big 
Increment. 

Figure 2 shows the FIS input e(t). 




Error e(t) 



Fig. 2. FIS input eft) 
Figure 3 shows the FIS input change of error e '(t). 









HB I P6 




X - /j\ A 




\ 'j?\ \ / 


04 


\" \Y 




/ \. / \ 








i i i \ i i i i 


-1 


i -• -S -A -2 


4 e 8 10 



Change of Error e"(t) 
Fig. 3. FIS input e'(t ) 

Table 1 shows the rule matrix for the FIS. 
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Figure 4 shows the FIS output (y(t)). 




Fig. 4. FIS output y(t) 
Figure 5 shows the control surface. 
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Fig. 5. Control Surface 
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The FIS is tested in an Incremental fuzzy PD controller [4]. Figure 6 shows the 
FLC architecture used. 
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Fig. 6. Fuzzy PD Incremental 

Figure 7 shows the output of the closed loop system for the previous FLC with 
a reference r(t) of 15 revolutions per minute (rpm). 
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Fig. 7. Velocity for 15 rpm 

As shown in figure 7, the output signal is not the best possible for this problem, 
the next step is to optimize the FLC to get a better response for engine speed 
regulation. 

3.1 Optimization Method 

For the optimization of the FLC using GAs, you must define the chromosome that 
represents the information of the individual, which in this case is related to the uni- 
verse of discourse and the linguistic terms. Figure 8 shows the chromosome of the 
GA. 



•%...« 


»,...*, 


tV 


.a. 


a a . 


..a. 


h (t ...b 2 


a Q . 


..a. 


a,. 


..a. 


k-A 


a„„.tf 3 




Input 1 
















1— 












I 









Fig. 8. GA chromosome 
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Figure 9 shows the triangular and trapezoidal membership functions (MF) that 
are used. 



fi(x) 
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a) b) 

Fig. 9. Parameters of the Membership Functions, a) MF trapezoidal, b) MF triangular 

The GA is of multiobjective type [9], which means that to determine the best 
individual three evaluations are performed: 

a) Minimum overshoot. 

b) Minimum undershoot. 

c) Minimum output steady state error (sse). 

The FLC linguistic terms were optimized with the GA, but the fuzzy rules are not 
changed. The process of the GA is shown in Figure 10. 
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Fig. 10. Optimization GA 
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4 Experimental Results 

To evaluate the ability of the GA, the FLC was simulated for speed control using a 
mathematical model of the plant in Matlab-Simulink [12], as shown in Figure 6. A 
series of experiments was performed that are listed in Table 2. 

Table 2. GA Parameters for different experiments 



No. 


Generations 


Crossover 


Overshoot- 
Undershoot 


SSE 


Time 


1 


50 


0.8 


13.7324 


0.0147 


97.45621 


2 


50 


0.8 


10.0187 


1.4695e-004 


50.65335 


3 


50 


0.8 


14.6503 


0.0103 


102.4405 


4 


50 


0.8 


12.3445 


0.6916 


262.120341 


5 


30 


0.8 


12.9078 


0.0057 


65.638181 


6 


70 


0.7 


10.5311 


1.0487e-004 


79.378554 


7 


100 


0.7 


9.9176 


0.0072 


59.475111 


8 


100 


0.7 


11.1285 


0.0300 


176.874380 


9 


100 


0.9 


10.1559 


0.0013 


143.649806 


10 


50 


0.8 


15.8521 


1.2145e-007 


232.21931 






11 


200 


0.8 


14.7944 


0.0600 


269.598303 


12 


300 


0.9 


9.2882 


9.1190e-004 


129.769955 


13 


350 


0.8 


13.8138 


0.0051 


411.122418 


14 


70 


0.9 


14.2581 


1.5946e-004 


180.825022 



In experiment No. 10 the best FLC was found because this has the lower error 
value. Below are the FIS characteristics for experiments 1, 2 and 10. 

Figure 11 shows how the GA modified the parameters of the membership 
functions for the input e(t). 
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Fig. 11. FIS modified by the GA eft) 
Figure 12 shows the input e 'ft) modified by the GA. 
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Fig. 12. FIS modified by the GA e 'ft) 
Figure 13 shows the output yft) of the FIS. 
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Figure 14 shows the control surface modified by the GA. 
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Fig. 14. Control Surface 
Figure 15 shows the output signal of the FLC PD Incremental for experiment 1. 
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Fig. 15. Velocity 
Figure 16 shows the evolution of the steady state error (sse). 
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Fig. 16. Experiment 1 steady state error 
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Fig. 17. Experiment 1 fitness 
Figure 18 shows the evolution of the best individual. 
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Fig. 18. Experiment 1 best individual 
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Below the results for experiment 2 are shown. Figure 19 shows input e(t), 
modified by the GA. 




Fig. 19. FIS modified by the GA e(t) 
Figure 20 shows the modified input e '(t). 
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Fig. 20. FIS modified by the GA e '(t) 
Figure 21 shows the FIS modified output. 




Fig. 21. FIS modified by the GA y(t) 
Figure 22 shows the control surface modified by the GA. 




Fig. 22. Control Surface 
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Fig. 23. Experiment 2 Velocity 
Figure 24 shows the evolution of the steady state error for experiment 2. 
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Fig. 24. Experiment 2 steady state error 
Figure 25 shows the evolution of the GA Fitness. 
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Figure 26 shows the evolution of the best individual. 
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As shown in Table 2, the best FIS evaluated for the FLC (Fuzzy PD incremental) 
was the one in experiment 10. The results are shown belows. 
Figure 27 shows the input e(t). 
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Fig. 27. FIS input eft) 
Figure 28 shows the input e 'ft). 
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Fig. 28. FIS input e'ft) 
Figure 29 shows FIS output. 
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Fig. 29. FIS output yft) 
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209 




1 "" 



e'<t> 



Fig. 30. Control Surface 
Figure 3 1 shows the speed engine. 
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Fig. 31. Experiment 10 engine speed 
Figure 32 shows the sse evolution for experiment 10. 
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Fig. 32. Experiment 10 sse 
Figure 33 shows the evolution of the GA fitness. 
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Fig. 33. Experiment 10 fitness 
Figure 34 shows the evolution of the best individual. 
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5 Conclusions 



A FIS for the speed control of a DC motor is proposed. Due to bad response of the 
initial closed loop FLC, the next step was to optimize it. Three membership func- 
tions for input and output (Trapezoidal and Triangular) were optimized, but the 
fuzzy rules are not optimized because is thought in the future to implement this 
GA in hardware, so we are thinking of the speed and efficiency of the algorithm. 
To evaluate the controllers, the GA considered three characteristics of the FLC, 
overshoot, undershoot and steady state error. 

Each FIS is simulated in an Fuzzy incremental PD for speed control of the DC 
motor. 

The best FLC was obtained in 50 generations with 80% crossover, with a result 
of 15.85 of overshoot and undershoot, steady state error of 1.2145e-007, in a time 
of 232.21 seconds with a speed of 15 rpm. 

Matlab-Simulink was used to perform the simulations, but in the future a hard- 
ware implementation will be done. 
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Abstract. In this paper we present a design of a fuzzy system for the longitudinal 
control of an F-14 airplane. The longitudinal control is carried out only by control- 
ling the elevators of the airplane. To carry out such monitoring it is necessary to use 
the stick, the rate of elevation and the angle of attack. These 3 variables are input 
into the fuzzy inference system, which is of Mamdani type, and we obtain as output 
the value of the elevators. After designing the fuzzy inference system we turn to the 
simulation stage. Simulation results of the longitudinal control are obtained using a 
plant in Simulink and those results are compared against the PID controller. 

1 Introduction 

This paper focuses on the field of fuzzy logic by taking into account also the con- 
trol area. Both areas can work together to solve various problems, in this case the 
longitudinal flight control. Over time the planes have evolved and at the same time 
improving their techniques for controlling their flight and avoid accidents as much 
as possible. For this reason, we are considering in this paper the implementation of 
a system that controls the horizontal position of the aircraft. We created the fuzzy 
system to perform longitudinal control of the aircraft and then we used a simula- 
tion tool to test the fuzzy controller under noisy conditions. We designed the fuzzy 
controller with the purpose of maintaining the stability in horizontal flight of the 
aircraft by controlling only the movement of the elevators. 

2 Background and Basic Concepts 

In this section we present some background of aviation and how it is changing 
over time and also some basic concepts for understanding this work. 

2.1 Aviation History 

In 1680 Giovanni Borello studied the application of the flight muscles of birds 
and he came to the conclusion that a person could never fly by himself. In the 
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nineteenth century Sir George Cayley established the modern concept of a plane 
and he applied the idea of a fixed wing capable of lifting the airplane [1]. Cayley 
established the aerodynamics foundation. In 1804 Cayley constructed a plane and 
realized his first unmanned flights. Cayley worked during the 5 decades following 
in his prototype and in 1853 an assistant of Cayley performed a flight of short du- 
ration in the plane that was in England [8]. In 1856 Jean-Marie Le Bris realized 
his first sky gliding flight higher than his point of departure thanks to the glider 
"L'Albatros Artificiel" , which for takeoff was drawn by horses in the beach. 
Etienne Lenoir who was a belgian inventor has the Credit for the first internal 
combustion engine in 1859, he developed only one horsepower and it was ineffi- 
cient, but that resulted in billions of internal combustion engines have been built 
since then. Many people were part of that inventions and development of the air- 
craft up to date is in continue growth and development for the benefit of humanity. 
If we want to speak about a war plane we can say something about the F-14 air- 
plane because it is the airplane that has been used for this task. The development 
of the F-14 came about because of the need to improve the aging American Fight- 
ers in use in the early days of the Cold War [11]. Some form of upgrade was re- 
quired if the United States was to maintain its competitive edge over the Soviet 
Union. As a result, the U.S. Navy and the U.S. Air Force teamed up to explore the 
possibility of developing a fighter plane that would, as far as possible, be appro- 
priate for the needs of both forces [9]. The practical flight aircraft was only possi- 
ble once there was the energy source capable of driving. Since then, they have 
created many types of engines, more powerful, small and reliable, and used as the 
required application [10] [4] [3]. 

2.2 Control Surfaces 

Every aircraft has control surfaces or other means which are used to generate the 
forces and moments required to produce the accelerations, which causes the air- 
craft to be steered along its three-dimensional flight path to its specified destina- 
tion. A conventional aircraft is represented in Figure 1. It is shown with the 
usual control surfaces, namely elevator, ailerons, and rudder. Such conventional 
aircrafts have a fourth control, the change in thrust, which can be obtained from 
the engines. Many modern aircraft, particularly combat aircraft, have considera- 
bly more control surfaces, which produce additional control forces or moments. 
Some of these additional surfaces and motivators include horizontal and 
vertical canards, spoilers, variable cambered wings, reaction jets, differentially 
operating horizontal tails and movable fins. One characteristic of flight control 
is that the required motion often needs a number of control surfaces to be used 
simultaneously [16]. 
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Aileron? 



Elevator 




Fig. 1. Conventional Aircraft [16] 

2.3 Flight Maneuvers 

Any aircraft is capable of 3 possible rotations about 3 mutually perpendicular axes 
whose intersection point is above the center of gravity of the airplane (see Fig. 2). 



.Roll 




Pitch 

Fig. 2. Possible rotations around the 3 axes 



As we can appreciate from Fig. 2, Pitch is the type of rotation that is done on the 
Y axis of the plane and it can be represented as an ascent or descent of its nose. 
Roll is a movement around the longitudinal axis of the plane (X axis) and be seen 
as a lateral inclination thereof. Yaw is a rotation about the vertical axis (Z) of the 
plane, and produces a change in the horizontal direction of flight. Each flight ma- 
neuver is assigned to a particular type of control: side control, direction control 
and longitudinal control. The first one, controls the ailerons and if one of these is 
flexed downward, it results in an increased lift on the wing for causing the rise of 
the same, while if the wing is bent upward, the wing produces a sustainability de- 
crease, encouraging the decline of the plane. The second one controls the rudder 
and this is done using the pedals. For a yaw movement to the right, the driver 
presses the right pedal, thus turning the rudder surface to the right. Finally, the 
longitudinal control is responsible for controlling the elevators and when we talk 
about longitudinal control we make reference to the pitching motion generated by 
the aircraft. All controlled flight consists of either one, or a combination or more 
than one, of these basic maneuvers. It is important to know what can happen if 
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you have to control the flight of an airplane because, when back pressure is ap- 
plied to the elevator control, the airplane's nose rises in relation to the pilot. When 
forward pressure is applied to the elevator control, the airplane's nose lowers in 
relation to the pilot. When right pressure is applied to the aileron control, the air- 
plane's right wing lowers in relation to the pilot. When left pressure is applied to 
the aileron control, the airplane's left wing lowers in relation to the pilot. When 
pressure is applied to the right rudder pedal, the airplane's nose moves (yaws) to 
the right in relation to the pilot. When pressure is applied to the left rudder pedal, 
the airplane's nose moves (yaws) to the left in relation to the pilot [6] [19] [12]. 

2.4 Forces Involved in Flight 

Four forces act upon an aircraft in relation to straight-and level, unaccelerated 
flight. These forces are thrust, lift, weight, and drag (see Fig. 3). 




Fig. 3. The four forces [7] 

Thrust is the forward force produced by the powerplant/propeller. It opposes or 
overcomes the drag force. As a general rule, it is said to act parallel to the longitu- 
dinal axis. This is not always the case as explained later. Drag is a backward, re- 
tarding force, and is caused by disruption of airflow by the wing, fuselage, and 
other protruding objects. Drag opposes thrust, and acts rearward parallel to the 
relative wind. Weight is the combined load of the airplane itself, the crew, the 
fuel, and the cargo or baggage. Weight pulls the airplane downward because of the 
force of gravity. It opposes lift, and acts vertically downward through the air- 
plane's center of gravity (CG). 

The lift opposes the downward force of weight, is produced by the dynamic ef- 
fect of the air acting on the wing, and acts perpendicular to the flightpath through 
the wing's center of lift [7]. Each airplane has an standard mathematical model 
and usually linear models are applied whose parameters are obtained from wind 
tunnel tests results using a scale model [2] [17]. These models are evaluated 
in each operation point with the information obtained in the tests. There are more 
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sophisticated nonlinear models that approximate the aerodynamic coefficients by 
truncated Taylor series [18]. 

3 Related Work 

The field of aeronautics has been studied extensively over the years with the aim 
of further improving the techniques of flying as much as possible to avoid acci- 
dents. There has been several research works in this area although each author 
takes their own approach to carry out the investigation. 

H. Enhelen [5] mentions in his work an intelligent and autonomous flight control 
system for an atmospheric re-entry vehicle that is investigated, based on fuzzy logic 
control and aerodynamic inversion computation, also a common PD-Mamdani fuzzy 
logic controller is designed for all the five re-entry flight regions characterized by 
different actuator configurations. B. Kadmiry [13] addressed the design of a fuzzy 
flight controller that achieves stable and robust "aggressive" maneuverability for an 
unmanned helicopter. The proposed fuzzy flight controller consists of a combination 
of a fuzzy gain scheduler and a linguistic (Mamdani-type) controller. The fuzzy gain 
scheduler is used for stable and robust altitude, roll, pitch, and yaw control. The lin- 
guistic controller is used to compute the inputs to the fuzzy gain scheduler, i.e., 
desired values for roll, pitch, and yaw at given desired altitude and horizontal veloci- 
ties. The flight controller is obtained and tested in simulation using a realistic 
nonlinear MIMO model of a real unmanned helicopter platform, the APID-MK3. 
Edgar Sanchez and Hector Becerra [22] describe different flight controllers for an 
X-Cell mini-helicopter. They divided their work in 2 parts. The first scheme con- 
sists of a conventional SISO PID controllers for z-position and roll, pitch and yaw 
angles. In the second scheme, two of the previous PID controllers are used for roll 
and pitch, and a linear regulator is added to control altitude and yaw angle. 

D. J. Walker [25] in his work proposes a separated longitudinal and lateral con- 
trollers, each designed using H-infinity optimization. Rachman Endri, et al. [20], 
in their work used non-linear simulation of controller for automatic flight control 
system to complete control design process. After, they used the linear approach in 
the design and simulation of the controller (control laws) and they simulated the 
controller on the non-linear real aircraft model. 

Jacob Reiner, et al. [21] in their work used an angle-of-attack command system 
for longitudinal control of a high performance aircraft. Keviczky Tamas and Balas 
Gary [14] in their work used a comparison between receding horizon control for 
the longitudinal axis control of an f-16 aircraft. As mentioned above, there are 
several works on aviation for their importance regarding the control, flight control 
range from small, trajectories, using methods such as fuzzy logic and inference 
systems [24][23][15][12]. 



4 Problem Description 

The purpose of this work was to develop a fuzzy system for automatic control to 
maintain the aircraft in horizontal flight. The goal was to create the fuzzy system to 
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perform longitudinal control of the aircraft and also to use a simulation facility to 
test the fuzzy controller with noise. The main goal was to achieve the stability in 
horizontal flight of the aircraft by controlling only the movement of the elevators. 



5 PID Control and Fuzzy System for Longitudinal Control 

If we want to use the longitudinal control we need to use 3 elements, which are: 

a) Stick: The lever of the pilot. Moving the control stick backwards (toward the 
pilot) will rise the nose of the plane, and if push forward there is a lowering 
of the nose of the plane. 

b) Angle of attack(a). 

c) Rate of elevation (q): The speed at which an aircraft climbs. 

We need the elements above mentioned to perform elevator control. The compari- 
son of the control system was carried out first using the PID plant of Matlab (see 
Fig. 4) for longitudinal control and then we compared the results obtained with the 
same plant but using a fuzzy controller that was created and eventually carried out 
the simulation of the 2 controllers and compare the results of fuzzy control with 
respect to PID control. 




Fig. 4. Simulation Plant with PID Controller 



The fuzzy system has 3 inputs (stick, angle of attack and rate of elevation) and 
1 output (elevators) (see Fig. 5). 
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Fig. 5. Fuzzy System 



The fuzzy system that we used as a controller has 3 membership functions for 
each of the inputs and 3 membership functions of the output. We worked with dif- 
ferent types of membership functions, such as the Gaussian, Bell, Trapezoidal and 
Triangular. The rules that we used for the fuzzy system are shown in Fig. 6. 



I . If (Stick & bap) sti (alpria(rad) is bajo) and (oXradteec) is toajo) then (Elevator Is bajo) (1 ) 

2 If (Stick is bap) and (atj>ha{fad) is medio) and (oXrad/sec) is medio) then (Elevator is bajo) (1 ) 

3 If (Stick is bajo) and (alpha^rad) is alto) and (q(radjsec) is alto) then (Elevator is medio) (1 } 

4 If (Stick is ba;o) and (atoha(fad) is bsto) and (ctfradfcec) is medio) ihen (Elevator is bajo) (1 ) 

5. If (Stick is medio) and (alpha(rad) is bajo) and (q(radrsec) is bajo) then (Elevator is bafo) (1 ) 

6. If (Stick is bajo) and (atpha{r ad) is medio) and (q(ractfsec) is bajo) than (Elevator is bajo) (1) 
7- if (Stick is bajo) and (afciha(rad) is medio} and (q(radfeec) is medio)then (Elevator is bajo) (1 ) 

8. If (Stick is bajo) and (ak*ha(radj rs medio) and (q(rad. r sec) is afto) then (Elevator is medio) (1 ) 

9. If (Stick is bajo) and (alpha(rad) is alto) and (q(radfsec) is bajo) then (Elevator is bajo) (1 ) 
ID. If (Stick is bajo) and (alphas ad.) is alto) and (q(radrsec) is medio) then (Elevator is bajo) (l J 

II . If (Stick is bajo) and (alphafrad) is afto) and (okradteec) is alto) then (Elevator is bajo) (1 ) 

1 2 . If (Stick is medio) and (alpha(rad) is bajo) and (q(rad/sec) is bajo) then (Elevator is bajo) ( 1 ) 

1 3. If (Stick is medio) and (alpha(rad) is bajo) and (q(rad/sec) is medio) then (Elevator is bajo) (1 ) 

1 4 . If (Stick is medio) and (alphafrad) is bajo-) and (q(radfeec ) is alto) then (Elevator is bap) (1 ) 

15. If (Stick is medio) and (alpha(rad) is medio) and (e^rad/sec) is bajo) then (Elevator is bajo) (1 ) 

16. If (Stick is medio) and (alpha(rad) is medio) and (q(rad/sec) is medio) then (Elevator is medio) (1 ) 

1 7. If (Stick Is medio) and (alpha(rad) is medio) and (q(rad/sec) is alto) then (Elevator is medio) ( i ) 

18. If (Stick is medio) and (alpha(rad) is afto} and (q(radfeec) is bajo) then (Elevator is bajo) (1) 

1 9 . If (Stick is medio) and (aipha(rad) is alto) and (q(radfeec) Is medio) then (Elevator is medio) (1 ) 

20. If (Stick is medio) and (alpha(rad) is alto} and (q(rad/sec) is alto) then (Elevator is medio) (1 ) 

21 . If (Stick is afto) and (atpha(rad} is bap) and (q.( rad-sec j is bajo) then (Elevator is bajo) (1 } 

22. If (Stick is afto) and (sfcjhefrad) ' s tja i°) and (qfradtfsec) is medio) then (Elevator is medio) (1 ) 

23. If (Stick is afto) and (alpha(rad} is bap) and (q(rad/sec} is alio) then (Elevator is medio) (1) 

24. If (Stick is afto) and (a*5ha<rad) is medio) and (q(rad/see) is bajo) then (Elevator is medio) (1 ) 

25. If (Stick is afto) and (aftaha(rad} is medio} and (q(radfcec) is medio) then (Elevator is medio) (1 ) 

26. If (Stick is afto) and (aipha(rad) is medio} and (ckrad/sec) is afto) then (Elevator rs medio) (1 ) 

27. If (Stick is afto) and (alpha(rad} is afto) and (q<rad/sec) is ato) then (Elevator is afto) (1 ) 



Fig. 6. Fuzzy System Rules 



6 Simulation Results 



In this section we present the results obtained when performing the tests using the 
simulation plant with the PID and Fuzzy controllers. It also presents the results ob- 
tained by optimizing the fuzzy system. 
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6.1 Results with PID Controller 

The first simulation was performed with the PID controller and we obtained the ele- 
vators behavior shown in Fig. 7. We obtained an average elevator angle of 0.2967. 



| Viewer: Scope2 
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Fig. 7. Elevators Behavior 

6.2 Results with the Fuzzy Controller 

Once the simulation results with the PID Controller were obtained, we proceded with 
our Fuzzy Controller (see Fig.8) using the fuzzy system that was created previously. 
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Fig. 8. Simulation Plant with Fuzzy Controller 
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The simulation was carried out with different types of membership functions 
and the results that were obtained as shown in Table 1 . 

Table 1. Results for simulation plant with Fuzzy Controller 



Membership 
functions 


Trapezoidal 


Triangular 


Gauss 


Bell 


Errors 

comparing 

with PID 


0.1094 


0.1131 


0.1425 


0.1222 


Comments 


Fast 
Simulation 


Less Fast 
Simulation 


Slow 
simulation in 
comparison 
with previous 


Slow simu- 
lation in 
comparison 

with previous 



Having obtained the above results, we used a genetic algorithm to optimize the 
membership functions of the fuzzy system and after implementing the genetic 
algorithm we obtained the results shown in Table 2. 

Table 2. Results for simulation plant with Fuzzy Controller and Genetic Algorithm 



Genetic Algorithm 


Error with 
respect to PID 


Using Trapezoidal member- 
ship functions 


0.0531 


Using Gauss membership 
functions 


0.084236395 


Using Bell membership 
functions 


0.0554 


Using Triangular 
membership functions 


0.0261 
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Given the above results we can see that better results were obtained using ge- 
netic algorithms and in particular the best result was using Membership functions 
of triangular type. The best fuzzy system that the genetic algorithm produced is 
shown in fig. 9. 




Fig. 9. The best fuzzy system 



7 Conclusions 



Based on the results obtained with the proposed method we can say that to achieve 
control of the plane in the Pitch axis, the fuzzy controller is a good option to have 
a good control of the elevators and maintain the aircraft on the Pitch axis as 
required. 

The result that we obtained was better when we used the genetic algorithm op- 
timization with a triangular type membership function. It is important to consider 
different membership function because you can see the differences and you can 
choose the best result. Future work consists in adding noise to test the fuzzy con- 
troller under disturbances. 
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Abstract. This paper describes an evolutionary algorithm approach for the opti- 
mization of a reactive controller applied to a mobile robot. The algorithm will op- 
timize the Fuzzy Inference System and the position and number of the sensors on 
the robot, while trying to use the minimum power possible. 

1 Introduction 

Robots are being more commonly used in many areas of research and a reason for 
this is that they are becoming more accessible economically for researchers. In 
this paper we consider the optimization of a fuzzy controller; that gives the ability 
of reaction to the robot. This may be too general so let's limit what in this paper 
will be described as ability of reaction, this is applied in the navigation concept, so 
what this means is that when the robot is moving and at some point of its journey 
it encounters an unexpected obstacle it will react to this stimulation avoiding and 
continuing on its path. The trajectory and path following are considered indepen- 
dent parts and are not consider on this paper [19]. 

There are many traditional techniques available to use in control, such as PD, 
PID and many more but we took a different approach in the Control of the robot, 
using an area of soft computing which is fuzzy logic that was introduced by Zadeh 
[ 1 ] . Later this idea was applied in the area of control by Mandami [2] where the 
concept of FLC (Fuzzy Logic Controller) originated. It's also important to men- 
tion that this is not the only area were the fuzzy concepts are applied but it's where 
the most work has been done, and were many people have contributed important 
ideas and methods like Takagi and Sugeno [2], 

This paper is organized as follows: In section 2 we describe the mobile robot used 
in these experiments, section 3 describes the development of the evolutionary method. 
Section 4 shows the simulation results. Finally section 5 shows the Conclusions. 

2 Mobile Robot 

In this section we describe the particular mobile robot considered in this work. 
The robot is based on the description of the the Simulation toolbox for mobile 
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robots[22] assumes wheeled mobile robot consisting of one or two conventional, 
steered, unactuated and not-sensed wheels and two conventional, actuated, and 
sensed wheels (conventional wheel chair model). This type of chassis provides 
two DOF (degrees of freedom) locomotion by two actuated conventional non- 
steered wheels and one or two unactuated steered wheels. The Robot has two de- 
grees of freedom (DOFs): y-translation and either x-translation or z-rotation [22], 
Figure 1 shows the robots configuration, it will have 2 independent motors located 
on each side of the robot and one castor wheel for support located at the form of 
the robot. 




Fig. 1. Kinematic coordinate system assignments [22] 
The kinematic equations of the mobile robot are as follows: 



X 



R 



(1) 



Equation 1 represents the sensed forward velocity solution [22]. 
Equation 2 represents the Actuated Inverse Velocity Solution [22]. 



1 



Where under the Metric system. 









>> 



(2) 



17 1 7 

*B ' 'B Translational velocities [ — ], 

rad 
(x)g Robot z-rotational velocity [ ], 



rad 



COyy , OJyy Wheel rotational velocities [ ], 

R Actuated wheel radius [m], 

l a , lb Distances of wheels from robot's axes [m] 
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3 Evolutionary Method Description 

In this section we will cover the Genetic Algorithm applied to our problem of 
finding the best fuzzy reactive controller for a mobile robot. 

The purpose of using a evolutionary method, is to obtained the best reactive 
control possible, for the robot, but also taking into consideration other desirable 
characteristics on the robot that we want to improve making this a multi objective 
[17] problem, for this we will take advantage of the HGA (Hierarchy Genetic Al- 
gorithm) intrinsic characteristic to solve multi objective problems, now let us state 
the main goal of our HGA. Figure 2 shows the genetic process. 

The main goal is to optimize the Reactive Control taken in to consideration the 
following: 

Fine tune the Fuzzy Memberships 
Optimize the FIS if then fuzzy rules 
The mobile robot Power Usage 



Initial 
Population 



Selection 



Genetic 
Operations 



Evaluate the new 
population 



Crossover 



Mutation 



Optimization Considerations 

Reactive Control 
Power usage 



EC 
□p 



□ 



/ 
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.\ 
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Fig. 2. Genetic Algorithm process 



In Fig. 2, we show the global cycle process of the GA, under the Evaluation of 
the each individual, is where we are going, to measure the goodness of each of the 
FIS (Fuzzy Inference System) represent by each Individual chromosome, in our 
test area, that will take place in a unknown environment (Maze [19]) to the robot 
where the robot's objective will be find the exit, avoiding hitting the walls and any 
other obstacle present. 

Our criteria to measure the FIS global performance will take into consideration 
the following: 

o Number of collisions 

o Average time between hits 

o Cover Distance 
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o Time use to cover distance 
o Battery life 

All of these variables are the inputs of the Evaluation F1S that we will use to ob- 
tain the fitness value of each chromosome (shown in Figure 3). 




Fig. 3. Fitness FIS 

The FIS that we optimize is a Mamdani type fuzzy system, consisting of 3 in- 
puts that are the distances obtain by the robots sensors describe on section 2, and 2 
outputs that control de velocity of the servo motors on the robot, all this informa- 
tion is encoded on each chromosome. 

The chromosome architecture, shown on Fig. 4, where we have encoded the 
membership functions type and parameters, we have set a maximum number of 5 
membership functions for each of the outputs and input variables. 
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Fig. 4. Chromosome Architecture 
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4 Simulation Results 

We have worked on the reactive control for a mobile robot before, where we 
use a particular maze problem to test the effectiveness of each of the reactive con- 
trols, and we did not use any optimization strategy to fine tune the controllers as it 
was a manually process, and from that experience is that we decided to apply GA 
to this problem. On Figure 5 and Table lwe show the results obtained in the 
experiments. 





Fig. 5. Robot trajectory control by 10 Rules FIS (Left) and 27 FIS Rules (Right) 



Figure 6 shows the new optimized results with the GA. 




Fig. 6. Previous Result vs. New Controllers Experiments 
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Table 1. Reactive Control Non Optimized Results 



Experiment 


LAS 


RAS 


Time 


FE 


27 Rules FIS 


9 


38.26 


38.46 


60.34 


Yes 


10 


40.42 


40.64 


59.50 


Yes 


12 


40.06 


40.12 


60.70 


Yes 


Average 


39.58 


39.74 


60.18 




Standard Deviation 


1.16 


1.14 


0.62 




10 Rules FIS 


1 


31.51 


47.43 


55.75 


Yes 


2 


35.55 


54.33 


51.80 


Yes 


3 


36.66 


53.68 


55.40 


Yes 


4 


36.66 


53.68 


54.10 


Yes 


5 


36.91 


54.18 


49.95 


Yes 


6 


35.44 


52.51 


46.65 


Yes 


7 


36.14 


51.66 


49.85 


Yes 


8 


33.48 


49.58 


51.76 


Yes 


9 


37.61 


51.51 


55.15 


Yes 


Average 


35.55 


52.06 


52.27 




Standard Deviation 


1.92 


2.32 


3.10 





*LAS=Left Motor Average Speed 
*RAS= Right Motor Average Speed 
*FE= Found Exit 



5 Conclusions 



We are yet to conclude these experiments but we expect with the help of the HGA 
to improve the overall performance of the mobile robot, and improve the results 
obtained previously. 

We will test our best reactive controller obtained with the HGA under the same 
maze problem this in order to establish a comparison between the controllers. 
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Abstract. For achieving a good and friendlier interaction between man and robot, 
this paper used various techniques of psychology, such as the ODD protocol and 
Big Five. Loyalty is the fact that helps knowledge generation, expression, and 
learning in humans. These models of personality are verified, in real situations, 
this, by its application in the situation where humans and robots interact in view of 
the stage. 

1 Introduction 

Robotics has evolved considerably since its inception, examples of the use of them 
seriously, in car manufacturing. In recent times the use of robots in the field of 
households has grown, thanks to a decline in cost and size. It is mentioned that not 
too distant future, robots will provide services to individuals, to our workplaces 
and in our homes [1]. There are currently supporting some domestic chores (vac- 
uum cleaners). 

It would be wrong to think or believe that users learn about sensors, actuators 
and control architectures. In support of this, you have to convey a mental model 
that helps users make sense of the robot's behavior and understanding that are nec- 
essary steps on their side. The proper design of the interaction between humans 
and robots will be crucial for the understanding and acceptance of new robotic 
products [2], 

Some researchers [16, 17] have reported that in the human domain Affect, 
long-lasting affective relationship as well as event-based temporal aspects such as 
emotion significantly influences on the determination of human beings social be- 
haviors. Based on the above claims, we designed the mechanism of a robot loyalty 
formation by computationally modeling the affective relationship between a robot 
and people. Some social theories such as good-feeling generation principle, bal- 
ance social theory, social exchange theory are applied to our model development. 
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2 Preliminaries 

2.1 Agents 

Let's first deal with the notion of intelligent agents. These are generally defined as 
"software entities", which assist their users and act on their behalf. Agents make 
your life easier, save you time, and simplify the growing complexity of the world, 
acting like a personal secretary, assistant, or personal advisor, who learns what 
you like and can anticipate what you want or need. The principle of such intelli- 
gence is practically the same of human intelligence. Through a relation of collabo- 
ration-interaction with its user, the agent is able to learn from himself, from the 
external world and even from other agents, and consequently act autonomously 
from the user, adapt itself to the multiplicity of experiences and change its behav- 
ior according to them. The possibilities offered for humans, in a world whose 
complexity is growing exponentially, are enormous [20,21]. 

We need to be careful to distinguish between rationality and omniscience. An 
omniscient agent knows the actual outcome of its actions, and can act accordingly; 
but omniscience is impossible in reality. Crossing the street was rational because 
most of the time the crossing would be successful, and there was no way I could 
have foreseen the falling door. Note that another agent that was equipped with ra- 
dar for detecting falling doors or a steel cage strong enough to repel them would 
be more successful, but it would not be any more rational [1 1]. 

2.2 Agent Based Simulations 

As these non-linear, adaptive interactions are mostly too complex to be captured 
by analytical expressions, computer simulations are most often used. The basic 
idea of such simulations is to specify the rules of behavior of individual entities, as 
well as the rules of their interaction, to simulate a multitude of the individual enti- 
ties using a computer model, and to explore the consequences of the specified in- 
dividual-level rules on the level of population as a whole, using results of simula- 
tion runs. As the simulated entities are usually called agents, the simulations of 
their behavior and interactions are known as agent-based simulations. The proper- 
ties of individual agents describing their behavior and interactions are known as 
elementary properties, and the properties emerging on the higher, collective level 
are known as emergent properties [18]. 

2.3 Basic Properties of Agent-Based Models 

What makes agent-based models particularly appealing and interesting is that con- 
sequences on the collective level are often neither obvious, nor expectable, even in 
many cases when the assumptions on individual agent properties are very simple. 
Namely, the capability of generating complex and intriguing emergent properties 
arises. 

Not so much from the in-built rules of individual agent behavior, as from 
the complexity of the network of interactions among the agents. Precisely this 
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multitude of agents, as well as the multitude and complexity of their interactions, 
are the main reasons why in most cases formal mathematical deduction of results 
of an agent-based model is not possible. 

This is also the reason why the issues of complexity remained relatively under- 
explored until recently. Namely, as scientists regularly decide to pay attention to 
"problems defined by the conceptual and instrumental techniques already at hand" 
[4], "some facts are pushed to the periphery of scientific investigation, 
either because they are thought not to be relevant, or because their study would 
demand unavailable techniques". Accordingly, only after recent advances in the 
development of computational technology have enabled massive simulation ex- 
periments, the issues of emergent complexity came closer to the focus of scientific 
research. 

Besides the above mentioned modeling of bottom-up effects, i. e. the effects 
originating at the individual level and influencing the collective one, more com- 
plex agent-based models are also capable of modeling top-down effects, arising at 
the collective level and influencing the level of individual agents [22,23]. 

3 Personalities 

Several experiments have demonstrated that users are naturally biased to ascribe 
certain personality traits to machines, to PCs, and other types of media [25]. For a 
product designer, it is therefore important to understand how these perceptions of 
personality influence the interaction and how a coherent personality can be util- 
ized in a product. Personality is an extensively studied concept in psychology. As 
McAdams [5] point out, there is no "comprehensive and integrative framework for 
understanding the whole person". Carver [14] gives an impression of the diversity 
of research on personality. They present an overview of personality theories cate- 
gorized along seven perspectives, including the biological, psychoanalytic, neo- 
analytic, learning, cognitive self-regulation, phenomenological, and dispositional 
perspective. In outlines, these theories agree on the general characteristics of per- 
sonality, amongst others that personality is tied to the physical body; helps to de- 
termine how the person relates to the world; shows up in patterns (recurrent and 
consistent); and is displayed in many ways (in behavior, thoughts, and feelings). 
As our work concentrates on the expression of personality as a pattern of traits, 
personality research on dispositional traits was considered most relevant. This dis- 
positional perspective is based on the idea that people have relatively stable quali- 
ties (or traits) that are displayed in diverse settings. Dryer [6] stresses three focus 
points to maintain the coherence of the characters personality: (1) cohesiveness of 
behavior (2) temporal stability (3) cross-situation generality. A combination of 
several trait theories that focused on labeling and measuring people's personality 
based on the terms of everyday language (e.g. helpful, assertive, impulsive, etc.) 
led to an emerging consensus on the dimensions of personality in the form of the 
Big-Five theory. The Big-Five is currently the theory that is supported by most 
empirical evidence and it is generally accepted [5]. It describes personality in five 
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dimensions: extraversion, agreeableness, conscientiousness, neuroticism, and 
openness to new experiences. Recent studies have used personality theories such 
as the Big Five to assess people's perceptions of robot personality (e.g. [13, 30]). 
However, the Big-Five theory of personality can also be used as a framework to 
describe and design the personality of products, and in particular of robots. Nor- 
man [18] describes personality as: 'a form of conceptual model, for it channels 
behavior, beliefs, and intentions into a cohesive, consistent set of behaviors. 
Although he admits this is an oversimplification of the complex field of human 
personality, the statement indicates that deliberately equipping a robot with a per- 
sonality, it helps to provide people with good models and a good understanding of 
the behavior. 

Table 1. Five-factor model: dimensions and facets 



Personality 

dimension 


Personality facets 


Extraversion 


warmth, gregariousness. assertiveness. 
activity, excitement-seeking, positive 
emotion 


Agreeableness 


trust straightforwardness, alfiuism. 
compliance. modesty, tender- mindedness 


Conscientiousness 


competence, order, dutifuhiess. achievement, 
striving, self-discipline, deliberation 


Openness 


fantasy, aesthetics, feelings, actions, ideas. 
values 


Neuroticism 


anxiety, angry hostility, depression, self- 
coQsciousness. impulsiveness, vulnerability 



4 Big Five Personality Factors 

Why do we study personality? 

The NEO that you have just completed looks at 5 personality traits, known as the 
Big Five. We will briefly look at what traits are, how these personality factors 
were determined, what the traits mean, what the Big Five predict about our behav- 
ior, and how these factors might relate to motivation. 

What are traits? 



Traits are consistent patterns of thoughts, feelings, or actions that distinguish peo- 
ple from one another. Traits are basis tendencies that remain stable across the life 
span, but characteristic behavior can change considerably through adaptive proc- 
esses. A trait is an internal characteristic that corresponds to an extreme position 
on a behavioral dimension. 
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There have been different theoretical perspectives in the field of personality psy- 
chology over the years including human motivation, the whole person, and individ- 
ual differences. The Big Five falls under the perspective of individual differences. 

How were these personality factors determined? 

The Big Five represents taxonomy (classification system) of traits that some per- 
sonality psychologists suggest capture the essence of individual differences in per- 
sonality. These traits were arrived at through factor analysis studies. Factor is a 
technique generally done with the use of computers to determine meaningful rela- 
tionships and patterns in behavioral data. You begin with a large number of behav- 
ioral variables. The computer finds relationships or natural connections where 
variables are maximally correlated with one another and minimally correlated with 
other variables and then groups the data accordingly. After this process has been 
done many times a pattern appears of relationships or certain factors that capture 
the essence of all of the data. Such a process was used to determine the Big Five 
Personality factors. Many researchers tested factors other than the Big Five and 
found the Big Five to be the only consistently reliable factors [9, 22, and 26]. 

Strict trait personality psychologists go so far as to say our behavior is really 
determined by these internal traits, giving the situation a small role in determining 
behavior. In other words, these traits lead to an individual acting a certain way in a 
given situation. 

Allport, Norman and Cattell were influential in formulating this taxonomy 
which was later refined. Al port compiled a list of 4500 traits. Cattell reduced this 
list to 35 traits. Others continued to analyze these factors and found congruence 
with self- ratings, ratings by peers and ratings by psychological staff that eventu- 
ally became the Big Five factors. 

The Big Five factors are: 

I - extraversion vs introversion 

II - agreeableness vs antagonism 

III - conscientiousness vs undirectedness 

IV - neuroticism vs emotional stability 

V - openness to experience vs not open to experience 

There was a need for an integrative framework for measuring these factors. The 
NEO Personality Inventory was created by Costa and McCrae and originally 
measured only neuroticism, extraversion and openness. The other factors were 
added later. There are other measures of the Big Five, such as the BFI (Big Five 
Inventory) and the TDA (Traits Descriptive Adjectives). The NEO has the highest 
validity of the Big Five measurement devices [21]. 

What do the five traits mean? 

Keep in mind that the traits fall on a continuum and this overhead shows charac- 
teristics associated with each of the traits. Looking at these characteristics we can 
formulate what each of the traits mean. 
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• Openness - (inventive / curious vs. cautious / conservative). Appreciation 
for art, emotion, adventure, unusual ideas, curiosity, and variety of 
experience. 

• Conscientiousness - (efficient / organized vs. easy-going /careless). A 
tendency to show self-discipline, act dutifully, and aim for achievement; 
planned rather than spontaneous behavior. 

• Extroversion - (outgoing / energetic vs. shy / withdrawn). Energy, posi- 
tive emotions, urgency, and the tendency to seek stimulation in the 
company of others. 

• Agreeableness - (friendly / compassionate vs. competitive / outspoken). 
A tendency to be compassionate and cooperative rather than suspicious 
and antagonistic towards others. 

• Neuroticism - (sensitive / nervous vs. secure /confident). A tendency 
to experience unpleasant emotions easily, such as anger, anxiety, 
depression, or vulnerability [9, 22, 26]. 

5 The ODD Protocol 

The basic idea of the protocol is always to structure the information about an IBM 
in the same sequence (Fig. 1). This sequence consists of seven elements that can 
be grouped in three blocks: Overview, Design concepts, and Details (as a mne- 
monic, this sequence can be referred to as the ODD sequence). The overview con- 
sists of three elements (purpose, State variables and scales, process overview and 
scheduling), which provide an overview of the overall purpose and structure of the 
model. Readers very quickly can get an idea of the model's focus, resolution and 
complexity. After reading the overview it should be possible to write, in an object 
oriented programming language, the skeleton of a program that implements the 
IBM described. This skeleton includes the declaration of all objects (classes) de- 
scribing the models entities (different types of individuals or environments) and 
the scheduling of the model's processes. The block or element "Design concepts" 
does not describe the model itself, but rather describes the general concepts under- 
lying the design of the model. The purpose of this element of the protocol is to 
link model design to general concepts identified in the field of Complex Adaptive 
Systems [29]. These concepts include questions about emergence, the type interac- 
tions among individuals, whether individuals consider predictions about future 
conditions, or why and how stochastic city is considered. By referring to such 
general design concepts, each individual-based and agent-based model is inte- 
grated into the larger framework of the science of Complex Adaptive Systems. 
The third part of ODD, Details, includes three elements (initialization, input, sub- 
models) that present the details that were omitted in the overview. In particular, 
the submodels implementing the model's processes are described in detail. All in- 
formation required to completely re-implement the model and run the baseline 
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simulations should be provided here The logic behind the ODD sequence is: con- 
text and general information is provided first (Overview), followed by more stra- 
tegic considerations (Design concepts), and finally more technical details (De- 
tails). We can help readers understand our IBMs by always using this structure: a 
standard protocol that provides the information in an order that allows the reader 
to easily build on their previous understanding. Below, the seven elements of 
ODD are described. 

We conclude that what we badly need is a standard protocol for describing 
IBMs which combines two elements: (1) a general structure for describing IBMs, 
thereby making a model's description independent of its specific structure, pur- 
pose and form of implementation [24] and (2) the language of mathematics, 
thereby clearly separating verbal considerations from a mathematical description 
of the equations, rules, and schedules that constitute the model. Such a protocol 
could, once widely used, guide both readers and writers of IBMs. In this article we 
propose a standard protocol for describing IBMs (including agent-based models, 
multi-agent simulation, or multi-agent systems; see Discussion). The basic idea of 
the protocol was proposed by [28] and then discussed during an international 
workshop on individual-based modeling held in Bergen, Norway, in the spring of 
2004. Most participants of that workshop are among the authors of this article, 
leading to 28 authors from seven different countries. The work of the authors cov- 
ers a wide range of fields within ecology (e.g., marine, terrestrial, plant, animal, 
behavior, population, forest, theory, conservation, etc.), and the authors have alto- 
gether been involved in the writing of more than 200 IBM-based papers. We 
agreed to test and refine the standard protocol proposed by [28] by applying it to 
our own models: every author, or team of co-authors, rewrote one of their existing 
model descriptions using the new standard protocol. The set of 19 models used in 
this test differs widely in scope, structure, complexity, and implementation details. 
As a result of the test applications, the protocol was slightly revised. 

Here, we first present the standard protocol, which [7] refer to as the PSPC + 3 
protocols. The abbreviation "PSPC" referred to the initials of first four elements of 
the protocol (purpose, structure, process, concepts) and "+3" referred to the re- 
maining three elements. In the revised protocol, however, the names of some ele- 
ments have been changed. We are therefore using a new acronym, "ODD", which 
stands for the three blocks of elements 'Overview', 'Design concepts', and 'De- 
tails' (fig. 1). Then we present an example application of the protocol, and sum- 
marize our experience with test applications in a list of frequently asked questions 
which provides practical hints for using the protocol. Finally we discuss both 
our experience with the test applications and ODD's potentials and limitations 
and how it could contribute to further unification of the formulation and 
implementation of IBMs. 
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Overview 


Purpose 


Stala variables and scale* 


Process overview and scheduling 


Design concepts 


Design concepts 


Details 


Initialization 


Input 


Submodels 



Fig. 1. The seven elements of the ODD protocol, which can be grouped into the three locks: 
Overview, Design concepts, and Details 

6 What Is Fuzzy Logic? 

In this context, Fuzzy Logic (FL) is a problem-solving control system methodol- 
ogy that lends itself to implementation in systems ranging from simple, small, 
embedded micro-controllers to large, networked, multi-channel PC or work- 
station-based data acquisition and control systems. It can be implemented in 
hardware, software, or a combination of both. FL provides a simple way to arrive 
at a definite conclusion based upon vague, ambiguous, imprecise, noisy, or miss- 
ing input information. FL's approach to control problems mimics how a person 
would make decisions, only much faster [9]. 

FL incorporates a simple, rule-based IF X AND Y THEN Z approach to a solv- 
ing control problem rather than attempting to model a system mathematically. The 
FL model is empirically-based, relying on an operator's experience rather than 
their technical understanding of the system. For example, rather than dealing with 
temperature control in terms such as "SP =500F", "T <1000F", or "210C <TEMP 
<220C", terms like "IF (process is too cool) AND (process is getting colder) 
THEN (add heat to the process)" or "IF (process is too hot) AND (process is heat- 
ing rapidly) THEN (cool the process quickly)" are used. These terms are imprecise 
and yet very descriptive of what must actually happen. Consider what you do in 
the shower if the temperature is too cold: you will make the water comfortable 
very quickly with little trouble. FL is capable of mimicking this type of behavior 
but at very high rate [28]. 

7 Proposed Method 



Following is the proposed architecture of the SMA-personality, which is charged 
with working the personality of the robot, ODD protocol is used and the theory of 
the Big five, to define the personality, the SMA-personality, to communicate with 
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diffuse SMA and control access to the values of SMA-personality, are the data ob- 
tained by the AN, and with them assess their development. 

7.1 Design Concepts 

Emergency: Dynamics of the population to leave the conduct of (the robot), but 
the individual's life cycle and behavior are fully represented by empirical rules. 

For this to work, each player creates an SMA (robot), which consists of two 
agents, an agent that is responsible for controlling the engine and the last agent to 
act as coordinating multi-agent system [1,2,3,5]. 

We can describe the proposed method as follows: 

In the operation of the model, we have 2 main blocks, which are responsible for 
knowledge and learning (paradigm of intelligent agents) and the vision and control 
(fuzzy logic). 

These modulus are described below, the first module contains 3 agents, NA 
[Node Agent], TA [Task Agent] SA [System Agent] Agent System (AS), and will 
know every time the operation of the other agents. 

To detect a change in the environment the Agent Node [that is in charge of the 
sensors (ultrasonic, camera and two light sensors)] with the ultrasonic sensor and 
two light sensors, starts its operation, and we start at the state called reactive con- 
trol; this because you have to move and sense all the time until it finds an obstacle, 
this can happen once a photo is taken, which will be sent to the database of images 
(BDI), the image will be applied a pre-processing for recognition in order to know 
whether the object is found, this decision was taken on the angle (angle to take the 
decision to bypass the object), able to escape and move forward, trend data, they 
are caught in the trajectory control process, to observe the speed values. The Task 
Agent (which is responsible for the drive), once all the necessary parameters for 
the performance of the robot agent system (AS) who is the coordinator, and exe- 
cutes instructions in the robot. 

For the development of the Multi- Agent System we used Gaia [29] and UML 
for the analysis and design of the agents. The completion of the Multi-Agent Sys- 
tem (MAS) is based on the FIPA [10] standards for better utilization and greater 
possibility of extension. 

7.2 SMA-Personality Architecture 

Following is the proposed architecture of the SMA-personality, which is charged 
with working the personality of the robot, ODD protocol is used and the theory of 
the Big five, to define the personality, the SMA-personality, to communicate with 
diffuse SMA and control access to the values of SMA-personality, the personality 
will be measured in diffuse, so be empowered to have values closer to reality, is 
data obtained by the AN, and with them assesses its development. 

The values given will be personality in the role of personality characteristic, 
and these values are: 
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At work with big five 
Opening 

• More inventive, less inventive, no inventiveness, 

• More interesting, less funny, anything funny 

• More cautious, less cautious, without caution 

• More conservative, less conservative, nothing conservative 

Conscientiousness 

• Very efficient, less efficient, no efficient 

• More organized, poorly organized, nothing organized 

• Very sloppy, little careless, without caution 

• More conservative, less conservative, nothing conservative 

Extraversion 

• Very energetic, less energetic, anything energetic, 

• Very shy, somewhat shy, not shy 

Kindness 

• Very friendly, unfriendly, unkind, 

• Very compassionate, unkind, unsympathetic 

• Very cooperative, uncooperative, no operation 

Neuroticism 

• Very sensitive, very sensitive, no sensitive 

• Very nervous, a little nervous, not nervous 

• Much security, poor security, no insurance 

• Very anxious, somewhat anxious, nothing anxious 

• Very depressing, slightly depressed, nothing depressing. 

Personality is allocated according to what you want in a home, according to the 
agent can learn to take some of the factors and have a different personality. 

Below is the outline of the architecture of personality, which is composed by 2 
agents, agent personality and consciousness, as well as several modules, such as a 
knowledge base that BC, contains a knowledge base and grow from According the 
data of the other process of the SMA, will also be a module of thought generated, 
which will take care of having the information generated from the consciousness 
of thought, emotion module, will have a set of basic emotions, the module of feel- 
ing that will have a set of basic elements, fig 2. 
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Fig. 2. SMA-Personality 



8 Results 



We show in Figure 3 the plant and the controller that we are used for achieving the 
simulation in Simulink of Mat lab. 
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Fig. 3. Plant for simulation 
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Together with some fuzzy systems which were developed based on work taken 
as a reference [16,17] and that were the basis for the experimentation with differ- 
ent numbers of rules, parameters of membership functions, and types of member- 
ship functions was a good performance of our robot, achieving our goal. 

In figure 4 we show the results presented above for a better understanding, 
which can be seen in the upper part the time it takes for each experiment out of the 
maze at the bottom and the approximate distance traveled to each robot in the ex- 
periment, the red dots are the results of our work, the blue dots are the results of 
the reference points and finally green means yes or no out of the labyrinth in this 
experiment. 
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Fig. 4. Comparison between the work of references [13][14] and the current job 



There are occasions when our results are very similar and therefore only seems 
to be the green dot that tells us whether or not we go out of the labyrinth that is the 
case of experiment number nine in the graph of time in which both works coin- 
cided in time output and both were successful in finding the exit of the maze. 

Now we show other experiments with a fuzzy system with different number of 
fuzzy rules, membership functions and parameters of membership functions. It is 
noteworthy that although this is not a fuzzy system which is optimized for this any 
optimization method can be used. 

The following fuzzy system in Fig. 5 has three inputs, two outputs and consists 
of ten rules to make the inference. 
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Fig. 5. Shows the fuzzy system that works for the simulations 

The first input variable of the fuzzy system is the ultrasonic sensor which has 
three membership functions which are linguistic (close, near, far), as shown in 
Fig. 6. 




input variable " Sense rS> 



Fig. 6. The ultrasonic sensor 

The second variable of the fuzzy system is the light sensor that has two func- 
tions that are membership free and wall, as shown in Fig. 7. 
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Fig. 7. The left light sensor 



The third variable of the fuzzy system is the light sensor that has two functions 
that are membership free and wall, as shown in Fig. 8. 
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Fig. 8. The right light sensor 

In Fig. 9 we show the rules that are used in the fuzzy system, which are 10 
fuzzy rules. 



If (SensorSonico is Lejos) and (SensorLuzl is Pared) and (SensorluzD is Pared) then (MoiorD is Ade!anteR)(Mcrtorl is AdelanteR) (1 ) 

2. If (SensorSonico is Cerca) and (SensorLuzl is Pared) and (SensorluzD is Pared) then (MotorD is AdelanteL)(Motorl is AdelanteL) (1 ) 

3. If (SensorSonico is MuyCerca) and (SensorLuzl is Pared) and (SensorluzD is Pared) then (MotorD is AdelanteL)(Motorl is AdelanteL) (1 ) 
If (SensorSonico is Lejos) and (SensorLuzl is Libre) and (SensorluzD is Pared) then (MotorD is AdelanteR)(Motorl is AtrasL) (1 ) 

5. If (SensorSonico is Cerca) and (SensorLuzl is Pared) and (SensorluzD is Libre)then (MoiorD is AtrasL)(Motorl is AdelanteR) (1) 

6. If (SensorSonico is MuyCerca) and (SensorLuzl is Pared) and (SensorluzD is Libre) then (MotorD is AtrasL)(Motorl is AdelanteR) (1) 

7. If (SensorSonico is Lejos) and (SensorLuzl is Pared) and (SensorluzD is Libre) then (MotorD is AtrasL)(Motorl is AdelanteR) (1 ) 
If (SensorSonico is Cerca) and (SensorLuzl is Libre) and (SensorluzD is Libre) then (MotorD is AcielanteR)(Motorl is Cero) (1 ) 
If (SensorSonico is Lejos) and (SensorluzD is Pared) then (MotorD is AdelanteL)(Motorl is AdelanteL) (1 ) 

Fig. 9. Rules of the fuzzy system (FIS) 

Having modified and improved the fuzzy systems, we proceeded with a slight 
modification to our Multi-Agent System, which consists in adding a new agent 
which has the task of giving a second option in case our robot gets stuck some- 
where in the world, for example corners in which our sensors cannot do some ac- 
tion, then come into action. The new agent called Support Agent enters into action 
as I mentioned in the case that the sensor values do not become active some rule 
of fuzzy systems that we throw the speed of the drive to get out of where we are 
stuck with only works with the task agent (TA) for which our multi-agent system 
would be as shown in Fig. 10. 
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Fig. 10. New scheme of Multi- Agent System 
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Integrating it to our general scheme we would be as shown in Fig 1 1 . 
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Fig. 11. New architecture of the complete system [5] 

Now with these changes we had better control of movements of the robot in the 
search for the exit of the maze, which can be seen in the graphs of the following 
experiments, fig. 12, 13, and 14. 
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9 Conclusions 



The development of intelligent applications is varied, in this particular case, the 
implementation of multi-agent systems is complex because it consists of several 
agents, whit the incorporation of other techniques, making it even more complex. 

One of the main objectives of this investigation is to open a new area of intelli- 
gent agents backed by human psychology, and with the support of fuzzy logic, 
which hitherto has not been the approach intended here, which gives a degree of 
utilization of this paradigm of intelligent agents, but more research is needed to 
defend this innovative idea. 

Given the non-optimized fuzzy systems, we have shown better results than 
were obtained in the work taken as reference [5]. From what we have achieved, 
we can say that a good percentage of the objectives in the area of control and 
learning paths. Future work consists in testing the SMA-personality, adding fuzzy 
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logic to the SMA, resulting in a fuzzy SMA with broader personality, both for 
decision making, and the personality. 
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Abstract. Even though it is known that chaotic time series cannot be accurately 
predicted, there is a need to forecast their behavior in may decision processes. 
Therefore several non-linear prediction strategies have been developed, many of 
them based on soft computing. In this chapter we present a new neural network 
architecutre, called Hybrid and based-on-Wavelet-Reconstructions Network 
(HWRN), which is able to perform recursive long-term prediction over highly dy- 
namic and chaotic time series. HWRN is based on recurrent neural networks em- 
bedded in a two-layer neural structure, using as a learning aid, signals generated 
by wavelets coefficients obtained from the training time series. In the results re- 
ported here, HWRN was able to predict better than a feed-forward neural network 
and that a fully-connected, recurrent neural network with similar number of nodes. 
Using the benchmark known as NN5, which contains chaotic time series, HWRN 
obtained in average a SMAPE = 26% compared to a SMAPE = 61% obtained by a 
fully-connected recurrent neural network and a SMAPE = 49% obtained by a feed 
forward network. 

1 Introduction 

The use of long-term prediction as a tool for complex decision processes involving 
dynamical systems has been of high interest for researchers in the last years. Some 
current prediction strategies approximate a model of the unknown dynamical sys- 
tem analyzing information contained in a solely time-series, which is supposed to 
described the system's behavior. A time series may be defined as an ordered se- 
quence of values observed from a measurable phenomena: x l ,x 2 ...x n ; such observa- 
tions are sensed at uniform time intervals and may be represented as integer or real 
numbers [25]. Once defined, an approximated model may be used to predict the 
trend of the system behavior or to predict as much specific values of the time se- 
ries as desired. As usual, such model will be just as good as the information used 

O. Castillo et al. (Eds.): Soft Computing for Intell. Control and Mob. Robot., SCI 318, pp. 253 1-268. 1 
springerlink.com © Springer- Verlag Berlin Heidelberg 2010 



254 P. Gomez-Gil, A. Garcia-Pedrero, and J.M. Ramirez-Cortes 

to construct it and as the capability of the modeler to represent important informa- 
tion embedded in the time series being analyzed. 

Time series prediction consists on estimating future values of a time series 
x t+l ,x t+2 .- using past time series values x 1 ,x 2 ---x r . One-step or short-term predic- 
tion occurs when several past values are used to predict the next unknown value of 
the time series. If no exogenous variables are considered, one-step prediction may 
be defined as [19]: 



K t+\ -<f\X t _i,A t _2— *t-p> 



(1) 



where is a approximation function used to predict. Similarly, long term predic- 
tion may be defined as: 

x t+h— x t+2' X t+l =< P( x t-l' X t-2— x t-p) (2) 

where h denotes the prediction time horizon, that is, the number of future values to 
be obtained by the predictor at once. Long term prediction may also be achieved 
by recursive prediction, which consists of recursively using equation (1) by feed- 
ing back past predicted values to the predictor to calculate the new ones. 

The construction of models able to predict highly nonlinear or chaotic time se- 
ries is of particular interest in this research. A chaotic time series is non-stationary, 
extremely sensitive to initial conditions of the system and contains at least one 
positive Lyapunov Exponent [15]. It is claimed that chaotic time series may only 
be short-term predicted [20]. Even though, in some cases it is possible to ap- 
proximate a dynamical model with similar characteristics to that found in the non- 
linear time series and to use it for long-term prediction. There are many techniques 
used to build predictors; they may be linear or non-linear, statistical or based on 
computational or artificial intelligence. For example, ARMA, ARIMA and Kal- 
man filters are linear methods [21]; k-nearest neighbors, genetic algorithms and 
artificial neural networks are examples of non-linear methods. Only non-linear 
methods are useful to forecast non-linear time series. 

The use of fully-connected, recurrent neural networks for long-term prediction 
of highly-dynamical or chaotic time series has been deeply studied [23]. In spite of 
the powerful capabilities of these models to represent dynamical systems, their 
practical use is still limited, due to constraints found in defining an optimal num- 
ber of hidden nodes for the network and the long time required to train such net- 
works. As a way to tackle these problems, complex architectures with a reduced 
number of connections, better learning abilities and special training strategies 
have been developed [13]; examples of such works are found at [2,3, 
4,10,11,13,15,25,26, 29, 30,31] among others. From the vast number of strategies 
used to improve the long term prediction ability of neural networks, Wavelet The- 
ory is used either to modify neuron architectures (for example [2,6,12,31,33]) or 
as a pre-processing aid applied to training data (for example [11,26,28,29]). When 
wavelet theory is used to modify the neuron architecture, normally it is done using 
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a wavelet function as the activation function [33]. Other works combine wavelet 
decomposition (as a filtering step) and neural networks to provide an acceptable 
prediction value [1 1,29]. 

In this chapter we present a novel neural prediction system called HWRN (Hy- 
brid and based-on-Wavelet-Reconstructions Network). HWRN is based on recur- 
rent neural networks, inspired at the Hybrid complex neural network [15] and with 
a particular kind of architecture and training scheme supported by wavelet decom- 
position. In the experiments reported here, HWRN was able to learn and predict as 
far as 56 points of two highly-dynamical time series, obtaining better performance 
than a fully-connected recurrent neural network and a three-layer, feed-forward 
neural network with similar number of nodes than the HWRN. This chapter is or- 
ganized as follows: section two describes the main characteristics, general struc- 
ture and training scheme of the model. In the same section some details are given 
related to reconstruction of some signals that are used for supporting training, 
which is based on discrete wavelet transforms. Criteria used to evaluate the per- 
formance of the system are presented at section three. Section four describes the 
experiments performed and their results; it also includes a description of the time 
series used to evaluate the model. Last section presents some conclusions and on- 
going work. 



2 Model Description 

HWRN is built using several small, fully-connected, recurrent neural networks 
(SRNN) attached to a recurrent layer and an output layer. Figure 1 shows the gen- 
eral architecture of HWRN. The SRNN are used to learn signals obtained from the 
training time series that contain different frequency-time information. Outputs of 
the SRNN are fed to a recurrent layer, which is able to memorize time information 
of the dynamical system. The last layer acts as a function approximator builder. 
The output of each node i at HWRN and SRNN is defined as: 

d ^ = -y i + a(x i ) + L (3) 

dt 

where: 



X; 






represents the inputs to the i-th neuron coming from other 
m neurons, 
/, is an external input to i-th neuron, 

W jt is the weight connecting neuron i to neuron j, 

a( x ) is the node's transfer function; it is a sigmoid for all 

layers except output layer, for which transfer function is linear. 
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In order to be solved, equation 3 may be approximated as [27]: 

y t (t + At) = (1 - At)y t (t) + Ata( Xi (t)) + AtL t (t) (5) 

for a small At , where: 

in 

7=1 

For the results reported here, initial conditions of each node y i (t = 0) , are set as 
small random values. Indeed, there are no external inputs to nodes, that is 
/,-(/) = for all i, all t. 

Training of a HWRN predictor contains three main phases: 

1. Pre-processing of the training time series and generation of reconstructed 
signals, 

2. Training of the SRNN, 

3. Training of the HWRN. 

After being trained, HWRN receives as input k past values of a scaled time series, 
then recurrent prediction is applied to obtain as many futures values as required. 
Each training phase is described next. 

2.1 Phase 1: Preprocessing 

HWRN requires a time series with enough information of the dynamical behavior 
in order to be trained. Such time series may contain integer or real values and the 
magnitude of each element must be scaled to the interval [0,1], This is required in 
order to use sigmoid transfer functions for the nodes in the network. To achieve 
this, the time series may be normalized or linearly scaled; in this research a linear 
scale transformation was applied, as recommended for financial time series by [7]. 
The linear transformation is defined as: 

x, - min(x) 

z, =lb + ' — — (ub - lb) (7) 

max(x) - min(x) 

where: 

ub is the desired upper bound; in this case ub = 1, 
lb is the desired lower bound; in this case lb = 0, 
max(x) is the maximum value found at the time series, 
min(x) is the minimum value found at the time series. 
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Fig. 1. A Hybrid and based-on- Wavelet-Reconstructions Network HWRN (adapted from [12]) 

If the original times series has missing values, they are approximated as the 
mean of their two nearest neighbors. No further processing is applied. 

An important challenge forecasting nonlinear and chaotic time series is the 
complexity found to represent its non-stationary characteristics. To tackle this, the 
HWRN learns frequency information related to different times using different 
components. It is known that wavelet analysis has been used to represent local 
frequency information in a signal. Such analysis calculates the correlation among 
a signal and a function y/(.), called wavelet function. Similarity among both 
functions is calculated for different time intervals, getting a two dimensional rep- 
resentation: time and frequency [1]. In this work, a multi-scale decomposition of 
the training signal is performed using the sub-band coding algorithm of the Dis- 
crete Wavelet Transform [22]. This algorithm uses a filter bank to analyze a dis- 
crete signal x(t). This bank is made of low-pass L(z) and high-pass H(z) filters, 
separating frequency content of the input signal in spectral bands of equal width. 
Figure 2 shows a one-level filter bank. After performing a down-sampling with a 
factor of two, signals cA(t) and cD(t) are obtained. These signals are known as ap- 
proximation and detail coefficients, respectively. This process may be executed it- 
eratively forming a wavelet decomposition tree up to any desired resolution level. 
A three-level decomposition wavelet tree, used for the experiments presented in 
this paper, is shown in Figure 3. The original signal x(t) may be reconstructed 
back using the Inverse Discrete Wavelet Transform (iDWT), adding up the out- 
puts of synthesis filters. Similarly it is possible to reconstruct not only the original 
signal, but also approximation signals that contain low-frequency information of 
the original signal and therefore more information about long-term behavior. In 
the same way, detail signals can be reconstructed; they contain information about 
short-term changes in the original signal. Using the decomposition wavelet tree at 
figure 3, four different signals may be reconstructed (one approximation and three 
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detail signals) using the coefficients shown at the leaves of such tree. For the rest 
of this chapter, these signals are referred as "reconstructed signals." 

For example, figure 4(a) shows a chaotic time series called NN5-101 (see sec- 
tion 4); figure 4(b) shows its most general approximation obtained using coeffi- 
cients cA 3 (see figure 3); figure 4(c) shows the most general detail signal obtained 
using coefficients cD 3 ; figure 4(d) shows detail signal at level 2 obtained using 
coefficients cD 2 ; figure 4(e) shows detail signal at maximum level obtained using 
coefficients cDj. 

During the predictor training, a set of these reconstructed signals is selected and 
independently learned by a set of SRNN. In order to figure out which recon- 
structed signals contain the most important information, all possible combinations 
of reconstructed signals are created; next, signals in each combination are added 
up and the result is compared with the original signal using Mean Square Error 
(see equation 8). The reconstructed signals in the combination with the smallest 
MSE are selected to be learnt by the SRNN. 
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Fig. 4. (a) Original signal NN5-101 (data taken from [7]) 
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Fig. 4. (b) Most general approximation signal obtained from NN5-101 
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Fig. 4. (c) Most general detail signal obtained from NN5-101 
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Fig. 4. (d) Detail signal at level 2 obtained from NN5-101 
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Fig. 4. (e) Detail signal at maximum level obtained from NN5-101 

2.2 Phase 2: Training the SRNN 

SRNN are trained to predict one point in each selected reconstructed signal; they 
receive as input k values of the corresponding reconstructed signal and predict the 
next one. Once trained, the SRNN require only the first k values of the recon- 
structed signal; the rest values are generated using recursive prediction as long as 
the predictor works. These k values are stored as free parameters of the system, to 
use them when prediction of the time series is taking place. 

Training of all SRNN is performed using the algorithm "Real-time real- 
learning based on extended Kalman filter (RTRL-EKF)" [16]. This algorithm con- 
tains 2 parts: gradient estimation and weights adjustment. The first part is done us- 
ing the Real-Time, Real-Learning Algorithm proposed by Williams and Zipser 
[32]; second part is done using an extended Kalman Filter. RTRL-EKF has a 
complexity of 0(n 4 ), where n is the number of neurons in the neural network [12]. 

2.3 Phase 3: Training the HWRN 



After training all SRNN, their weights are imbedded in the architecture of the 
HWRN (see figure 1) which also contains a hidden layer with recurrent connec- 
tions and an output layer with feed-forward connections. The complete architec- 
ture is trained to predict one point of the original signal, keeping fixed the weights 
of sub-networks SRNN. As in the case of SRNN, training is performed using 
"Real-time real-learning based on extended Kalman filter (RTRLEKF) algorithm" 
[16]. 
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3 Metrics for Performance Evaluation 

The prediction ability of the proposed architecture and comparative models was 
measured using three metrics: Mean Square Error (MSE), Symmetrical-Mean Ab- 
solute Percentage Error (SMAPE) and Mean Absolute Scaled Error (MASE). Next 
each metric is explained. 

"Mean Square Error" is defined as: 

MSE = -^{x t -x t f 

n f=i (8) 

The "Symmetrical-Mean Absolute Percentage Error" is scale-independent; there- 
fore it is frequently used to compare performances when different time series are 
involved [17]. This is the official metric used by the "NN5 forecasting competi- 
tion for artificial neural networks & computational Intelligence" [8]. SMAPE is 
defined as: 



SMAPE = -Y 

n ,_, 



f l „ t, l s \ 






<X+jg/2 



(100%) 

(9) 



It is important to point out that SMAPE cannot be applied over time series with 
negative values. 

Other popular metric is the "Mean Absolute Scaled Error," defined as: 



1 " 
MASE = -^ 



n 



1 ^-in 



=2 , x «' x iA 



(10) 
where x, is the original time series and x t is the predicted time series. 

4 Experiments and Results 

The proposed architecture and training scheme were tested using two benchmark 
time series; they are: 

a) The time series generated by Matlab function sumsin(), available at version 
7.4 and commonly used in Matlab demos [24]. It is defined as: 

s{t) = sin(3f ) + sin(0.3f) + sin(0.03?) (11) 

Figure 5 shows an example of 735 points of sumsin() time series. 
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b) Eleven of the time series found in the database of the "NN5 Forecasting 
Competition for Artificial Neural Networks and Computational Intelligence" [8]. 
These time-series correspond to cash drawbacks occurred daily in teller machines 
at England from 1996 to 1998; these series may be stationary, have local tenden- 
cies or contain zeroes or missing values. Figure 4 (a) shows the first time-series of 
such database, identified as "NN5-101". The eleven time-series used here corre- 
spond to what is called the "reduced set" in such competition. In order to deter- 
mine if these series were chaotic, the maximum Lyapunov Exponent (LE) of each 
one was calculated using the method proposed by Sano and Sawada [18]. Table 1 
shows the maximum LE of each time series; notice that all are positive, an indica- 
tion of chaos. 




Fig. 5. 735 points of the time series sumsin() 



Table 1. Maximum LE of reduced set series NN5 [12] 



Series ID 


Maximum LE 


NN5-101 


0.0267 


NN5-102 


0.6007 


NN5-103 


0.0378 


NN5-104 


0.0565 


NN5-105 


0.0486 


NN5-106 


0.0612 


NN5-107 


0.0678 


NN5-108 


0.0384 


NN5-109 


0.8405 


NN5-110 


0.0621 


NN5-111 


0.0220 
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The HWRN contains 3 SRNN; the number of nodes at each SRNN was from 6 
to 10, determined experimentally depending upon the reconstructed signal being 
learnt; the hidden layer has 10 nodes. The performance of HWRN was compared 
with a three layer, feed-forward neural network (5-26-1) and a fully-connected 
recurrent neural network with 5 input nodes, 26 hidden nodes and one output 
node. These architectures have a similar number of nodes as the HWRN. All ar- 
chitectures receive as input 5 values (A: = 5) of the time series and predict next 
value. Recurrent prediction is used to generate 56 futures values, following rules 
of the "NN5 forecasting competition for artificial neural networks & computa- 
tional Intelligence" [8]. The architecture was implemented using Matlab V7.4, 
C++, and public libraries for the training algorithm available at [5]. For both 
cases, four reconstructed signals were generated using DWT with wavelet func- 
tion Daubechies 'dbW available at Matlab. Three of the reconstructed signals 
were selected using the strategy described at section 2. 1 . 

Twelve experiments were executed for each time series and each neural model. 
For each experiment, a different random initial set of weights was used. All train- 
ings were made of 300 epochs. The first 635 values of each series were employed 
to train all models and the next 56 values were used as a testing set to compare the 
performance of the proposed architecture with respect to the other two models. 
The last 56 values of the series were used as a validation set in order to compare 
the performance of this architecture with respect to the competition results pub- 
lished by [9]. 

Table 2 shows the results obtained using recursive prediction of 56 values 
(validation set) by the 12 experiments over series sumsim(); the metric MAPE is 
not shown because it is not valid for negative values, as is with sumsin(). Figure 6 
plots 56 predicted values (validation set) of series NN5-109, which was the series 
at NN5 dataset that obtained the best prediction results, with a SMAPE = 20.6%. 
Figure 7 plots 56 predicted values (validation set) of series NN5-107, which was 
the worst case obtained with series NN5, with a SMAPE = 40.5%. 

Table 3 summarizes the average results obtained for the two cases, all experi- 
ments, all architectures predicting the validation set. For the three metrics in the 
two tested cases, HWRN got, in average, better results than the feed-forward and 
the fully-connected recurrent architectures. HWRN got a average SMAPE of 54% 
for the sumsinn() time series and 27% for the NN5 time series. It is important to 
point out that, with respect to contest results published by [9] using NN5 reduced 
test, HWRN could be located between the 16th and 17th place in the category of 
"neural networks and computational intelligence methods." 

Notice at table 3 the high Standard Deviation found in the performance meas- 
ured by MASE for the three architectures. This may be due to the facts that these 
series are chaotic (see table 1), and that the ability of the learning algorithm 
RTRLEKF to find the best solution space depend, among other factors, upon the 
initial set of weights randomly generated. However, it may be noticed that HWRN 
got the smallest Standard Deviation for these cases. 
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Table 2. Twelve experiments predicting validation set over series sumsinf). For a definition 
of MSA and MASE see equations (8) and (10) 



Experiment 


Feed-forward 


Recurrent 


HWRN 


Number 


network 


Network 








MSE 


MASE 


MSE 


MASE 


MSE 


MASE 


1 


0.112 


60.827 


0.236 


100.004 


0.110 


73.184 


2 


0.089 


49.823 


0.083 


56.638 


0.092 


66.314 


3 


0.070 


47.931 


0.036 


40.038 


0.370 


41.687 


4 


0.099 


58.184 


0.191 


86.501 


0.034 


41.807 


5 


0.061 


48.613 


0.023 


34.78 


0.029 


37.863 


6 


0.165 


80.531 


0.090 


53.566 


0.040 


43.689 


7 


0.096 


59.478 


0.003 


12.399 


0.132 


78.450 


8 


0.049 


49.816 


0.521 


107.336 


0.052 


49.518 


9 


0.104 


74.631 


0.146 


78.909 


0.054 


49.677 


10 


0.063 


55.017 


0.064 


53.904 


0.116 


67.361 


11 


0.063 


50.018 


0.087 


73.610 


0.099 


68.531 


12 


0.160 


84.994 


0.086 


61.378 


0.021 


32.832 


Mean 


0.094 


59.989 


0.131 


63.255 


0.068 


54.243 


St. deviation 


0.038 


13.044 


0.140 


27.553 


0.039 


15.551 
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Fig. 7. Worst prediction case using NN5, SMAPE = 40.5%, series NN5-109 

Table 3. Prediction errors obtained by the proposed architecture and two other architectures 
using 56 values ahead 



Time Series 


Metric 


Neural Architecture 






Feed-forward 


Recurrent 


HWRN 


sumsin() 


MSE 


0.09 ± 0.04 


0.13 ±0.14 


0.07 ± 0.04 


MASE 


59.99 ± 13.04 


63.25 ±27.55 


54.24 ± 15.55 


Eleven exam- 
ples of NN5 
time series 


MSE 


250.12 ±226.05 


198.69 ±131.12 


34.05 ± 20.12 


SMAPE 


49.28% ± 12.36 


60.75% ±13.05 


27.22% ±8.27 


MASE 


517.50 ± 
1,079.68 


546.31 ± 
1,218.95 


194.99 ± 

387.22 



5 Conclusions 

We presented a novel neural network predictor, called HWRN, based on a combina- 
tion of small, fully-connected recurrent sub-networks, called SRNN, that are embed- 
ded in a composite neural system. This system is able to generate as many future val- 
ues as desired using recursive prediction. HWRN was able to predict up to 56 points 
ahead of several non-linear time series, as shown by experiments done using the time 
series generated by Matlab's function sumsin() and the time series found at the reduced 
set of the "NN5 Forecasting Competition for Artificial Neural Networks and Compu- 
tational Intelligence" [8]. The SRNN's are trained to reproduce selected reconstructed 
signals that represent different frequencies at different times of the original one. The 
reconstructed signals are obtained using the Discrete Wavelet Transform and the In- 
verse Discrete Wavelet Transform [1]. In average, the HWRN obtained a Symmetri- 
cal-Mean Absolute Percentage Error (SMAPE) of 27% when predicting in a recursive 
way 56 points ahead of 1 1 chaotic NN5 time series. This performance was better than 
the obtained with a fully-connected recurrent neural network (SMAPE= 61%) and a 
feed-forward network (SMAPE = 49%), both with similar number of nodes and 
weights. The main drawback of this system is the time required to train it. Currently 
our research group is looking for ways to train this system faster and for a efficient 
method to select the reconstructed signals generated by the iDWT. 
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Abstract. A novel homogeneous integration strategy of an interval type-2 fuzzy 
inference system (IT2FIS) with Takagi-Sugeno-Kang reasoning (TSK IT2FIS) is 
presented. This TSK IT2FIS is represented as an adaptive neural network (NN) 
with hybrid learning (IT2FNN:BP+RLS) in order to automatically generate an 
interval type-2 fuzzy logic system (TSK IT2FLS). Consequent parameters are 
updated with recursive least-square (RLS) algorithm; antecedent parameters with 
back-propagation (BP) algorithm. Mackey-Glass chaotic time series forecasting 
results are presented (x=17, 30, 100) with different signal noise ratio (SNR). 
Soundness for uncertainty, adaptability and learning and generalization 
capabilities is shown using 10-fold Cross Validation, Akaike Information Criteria 
(AIC) and F-Test. 

1 Introduction 

System's modeling is considered one of the most important problems in science. 
Models are abstractions of reality that can be applied to improve our 
understanding of real world phenomena. Finding functional relations between 
variables that intervene in phenomena is a problem that arises in several areas like 
economy, engineering, biology, sociology and medicine. Approximation of 
unknown functions from experimental data, e.g. time series prediction, is 
particularly important. In this case a sequence of time sample sets is used to 
predict the system's behavior in short and long-term. Several types of uncertainty 
are originated by improper data, bad modeling or wrong interpretations given to 
models. Also, the nature of phenomena might generate fluctuating data or 
variables, given thus uncertain data. Human mistakes and bad equipment 
calibration can generate incorrect data [1, 2,6]. Imprecision is the cause of 
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language granularity. Vagueness is considered a new imperfect information 
cathegory. Inconsistence is originated by data redundancy that generate conflict or 
contradictions. Uncertainty levels vary depending on context. This uncertainty can 
be caused by noise in data [3,4,47,48]. Type-1 (T1FLS) and interval type-2 
(IT2FLS) fuzzy logic systems, combined with techniques like neural networks 
have shown their ability to solve different kinds of problems like control [10-12, 
14, 15, 16, 27, 32, 42, 44], prediction [12, 17, 20], signal and images processing 
[12, 13, 18, 28, 29], industrial processes [21, 22], etc. In many cases, success was 
achieved due to adding human expert knowledge. There is consensus between 
researchers that more intelligent systems can be developed by hybrid soft 
computing methodologies [41,45]. Type-1 Fuzzy Neural Network (T1FNN) [9, 
40, 41] and Interval Type-2 Fuzzy Neural Network (IT2FNN) [22-26]; type-1 [31, 
33, 35, 36, 38] and type-2 [32] fuzzy evolutionary systems are typical hybrid 
systems in softcomputing. These systems combine T1FLS generalized reasoning 
methods [30, 41, 42, 44, 45] and IT2FLS [19-21] with neural networks learning 
capabilities [41, 43, 46, 49] and evolutionary algorithms [34, 37, 39, 43], 
respectively. In this paper, four main architecures (IT2FNN-0, IT2FNN-1, 
IT2FNN-2 and IT2FNN-3) for integrating a first order TSK IT2FIS, with real 
consequents (A2C0) and interval consequents (A2C1), are proposed. Integration 
strategies for process elements of TSK IT2FIS are analised for each architecture 
(fuzzification, knowledge base, type reduction and defuzzification). 

2 Interval Type-2 Fuzzy Neural Network (IT2FNN) 

An IT2FNN combines an TSK IT2FIS and an adaptive neural network in order to 
take advantage of characteristics of each model. Adaptive networks [40,41] 
provide a theoretical frame that unifies almost all kinds of neural networks with 
learning capabilities, and their fundamental properties are a key element for 
understanding other paradigms. 

An IT2FNN is characterized by a directed graph G = (V, E) , where fan-in zero 
input nodes and fan-out zero output nodes. Each node /IE V represents a 
processing unit with an associated static function f , and each edge ee £ 
indicates causal relation between connected nodes. Node set V is divided into two 
separate subsets, V = (A'UN). Nodes n G A are called adaptive and their 
outputs depend not only on their inputs, but also on internal modifiable parameters 
{^",^2 5"-} internal to «£ A. On the other hand, nodes HE N whose 
functions depend only on inputs are called non adaptive. In general, when 
representing IT2FNN graphically, rectangles are used to represent adaptive nodes 
and circles to represent non-adaptive nodes. Output values of pair nodes (green 
color) and odd nodes (blue color) represent uncertainty intervals (Fig. 1-6). In this 
kind of interval type-2 neurofuzzy adaptive networks, nodes represent processing 
units called neurons, which can be classified into crisp and fuzzy neurons. In 
IT2FNN, type-1 fuzzy neuron model (T1FN) proposed by Hirota and Pedrycz [7, 
8] is extended into an interval type-1 fuzzy neuron (IT1FN) and interval type-2 
fuzzy neuron (IT2FN). 
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Fig. 1. Interval Type-1 Fuzzy Neuron (IT1FN) 



jl{Ml) $$ 




Fig. 2. Interval Type-2 Fuzzy Neuron (IT2FN) 



IT2FNN-0 architecture has 7 layers. Layer 1 has adaptive nodes for fuzzifying 
inputs; layer 2 has non-adaptive nodes with interval fuzzy values. Layer 3 (rules) has 
non-adaptive nodes for generating firing strength of TSK IT2FIS rules. Layer 4, 
lower and upper values of rules firing strength are normalized. Adaptive nodes in 
layer 5 (consequent) are connected to layer for generating rules consequents. Non- 
adaptive nodes in layer 6 evaluate values from left-right interval. Non-adaptive node 
in layer 7 (defuzzification) evaluates average of interval left-right values. 
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LayerS Layer 6 Layer 1 



Fig. 3. IT2FNN-0 Architecture 

IT2FNN-1 architecture has 5 layers, consists of adaptive nodes with equivalent 
function to lower-upper membership in fuzzification layer (layer 1). Non-adaptive 
nodes in rules layer (layer 2) interconnect with fuzzification layer (layer 1) in 
order to generate TSK IT2FIS rules antecedents. Adaptive nodes in consequent 
layer (layer 3) are connected to input layer (layer 0) to generate rules consequents. 
Non-adaptive nodes in type-reduction layer (layer 4) evaluate left-right values 
with Karnik and Mendel (KM)[19-21] algorithm. Non-adaptive node in 
defuzzification layer (layer 5) average left-right values. 




Fig. 4. IT2FNN-1 Architecture 
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In IT2FNN-2 case, consists on 6 layers, uses IT2FN for fuzzifying inputs 
(layers 1-2). Non-adaptive nodes in rules layer (layer 3) interconnect with lower- 
upper linguistic values layer (layer 2) to generate TSK IT2FIS rules antecedents. 
Non-adaptive nodes in consequents layer (layer 4) are connected with input layer 
(layer 0) to generate rules consequents. Non-adaptive nodes in type-reduction 
layer (layer 5) evaluate left-right values with KM algorithm. Non-adaptive node in 
defuzzification layer (layer 6) averages left-right values. 




Fig. 5. IT2FNN-2 Architecture 



IT2FNN-3 architecture has 8 layers, uses IT2FN for fuzzifying inputs (layers 
1-2). Non-adaptive nodes in rules layer (layer 3) interconnect with lower-upper 
linguistic values layer (layer 2) to generate TSK IT2FIS rules antecedents. 
Adaptive nodes in layer 4 adapt left-right firing strenght, biasing rules lower-upper 
trigger forces with synaptic weights between layers 3-4. Layer 5's non-adaptive 
nodes normalize rules lower-upper firing strenght. Non-adaptive nodes I 
consequent layer (layer 6) interconnect with input layer (layer 0) to generate rules 
consequents. Non-adaptive nodes in type-reduction layer (layer 7) evaluate left- 
right values adding lower-upper product of lower-upper triggering forces 
normalized by rules consequent left-right values. Node in defuzzification layer is 
adaptive and its output y is defined as biased average of left-right values and 
parameter J. Parameter J (0.5 by default) adjusts uncertainty interval defined by 
left-right values [y t ,y r ]. 
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Fig. 6. IT2FNN-3 Architecture 



3 Mackey-Glass Forecasting Chaotic Time Series 



In this chapter, results from simulations using ANFIS, IT2FNN-0, IT2FNN-1, 
IT2FNN2 and IT2FNN3 are presented for forecasting Mackey-Glass chaotic time 
series [50] with 7 = 17, 30, 100 and different signal noise ratio values, SNR(dB) = 
0, 10, 20, 30 as uncertainty source. Levenberg-Marquardt hybrid method is used 
for adapting IT2FNN models parameters. Proposed IT2FNN architectures are 
validated using 10-fold cross-validation [53, 54] considering sum square error 
(SSE) or root mean square error (RMSE) in training or test phase; Akaike 
information criteria (AIC) and F test [51, 52]. Cross-validation procedure 
evaluation is done using Matlab's crossvalind function. Noise is added by 
Matlab's awgn function. Also, AIC and F test are evaluated using statistical 
Matlab's Toolbox. In Mackey-Glass chaotic time series, 1200 data sets where 
generated with initial conditions x(0)=1.2 and x= 17, 30 and 100 using Runge- 
Kutta 4 th order method and several uniform noise added. An input-output vector 
was taken with format: [x(t-18), x(t-12), x(t-6), x(t); x(t+6)]. For identifying 
ANFIS [40, 41] and IT2FNN models, an IT2FNN model with 4 inputs and one 
output is used with Mackey-Glass chaotic time series using: 16 rules, 2 
igaussmtype2 IT2MF for each input, 50 epochs, 500 training data and 500 test 
data with 10-fold cross-validation. Tables 1-9 and figures 7-12 show RMSE (TRN 
and CHK) with 10-fold cross validation and the number of points ( g) out of 
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uncertainty interval Y(x) G [9i(x),y r ( x )] results between IT2FNN models and 
ANFIS. It can be seen that IT2FNN architectures forecast better than Mackey- 
Glass chaotic time series when noise is present. 

Table 1. RMSE (TRN/CHK) and ? values determined in models ANFIS and IT2FNN 
with 10-fold cross-validation for forecasting Mackey-Glass chaotic time series with T=ll 



SNR(dB) 





10 


20 


30 


free 


ANFIS 


TRN 


0.2643 


0.0902 


0.0316 


0.0110 


0.0020 


CHK 


0.2917 


0.0989 


0.0345 


0.0118 


0.0021 


g 


NA 


NA 


NA 


NA 


NA 


IT2FNN-0 


TRN 


0.2357 


0.0776 


0.0289 


0.0088 


0.0017 


CHK 


0.2576 


0.0837 


0.0327 


0.0107 


0.0019 


S 


28 


23 


17 


13 


7 


IT2FNN-1 


TRN 


0.2089 


0.0672 


0.0268 


0.0056 


0.0016 


CHK 


0.2334 


0.0875 


0.0318 


0.0078 


0.0018 


g 


25 


19 


15 


11 


6 


IT2FNN-2 


TRN 


0.1883 


0.0540 


0.0189 


0.0066 


0.0014 


CHK 


0.1947 


0.0592 


0.0207 


0.0071 


0.0015 


g 


24 


18 


14 


10 


4 


IT2FNN-3 


TRN 


0.1558 


0.0427 


0.0134 


0.0041 


0.0011 


CHK 


0.1632 


0.0549 


0.0148 


0.0043 


0.0013 


S 


20 


15 


11 


8 


3 



Mackey-Glass Chaotic Time-Series % = 1 7 



— ANFISTRN 

— IT2FNN-0:TRN 

— IT2FNN-1:TRN 

— IT2FNN-2:TRN 

— IT2FNN-3:TRN 
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Fig. 7. RMSE (TRN) values determined in models ANFIS and IT2FNN with 10-fold cross- 
validation for forecasting Mackey-Glass chaotic time series with r=17 
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Mackey-Glass Chaotic Time-Series x = 1 7 



-ANFIS:CHK 
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Fig. 8. RMSE (CHK) values determined in models ANFIS and IT2FNN with 10-fold cross- 
validation for forecastingMackey-Glass chaotic time series with r=17 



Table 2. Akaike Information Criteria (AIC) of TRN/CHK and b values determined in 
models ANFIS and IT2FNN with 10-fold cross-validation for forecasting Mackey-Glass 
chaotic time series with 7=17 



SNR(dB) 





10 


20 


30 


free 


ANFIS 


TRN 


1968.63 


893.58 


-155.29 


-1210.56 


-2915.30 


CHK 


2067.27 


985.66 


-67.49 


-1140.35 


-2866.51 


S" 


NA 


NA 


NA 


NA 


NA 


IT2FNN-0 


TRN 


1872.11 


761.12 


-226.61 


-1415.70 


-3059.82 


CHK 


1960.96 


836.79 


-103.08 


-1220.21 


-2948.60 


s 


28 


23 


17 


13 


7 


IT2FNN-1 


TRN 


1911.40 


777.22 


-142.05 


-1707.68 


-2960.45 


CHK 


2022.30 


1041.19 


29.02 


-1376.33 


-2842.66 


$" 


25 


19 


15 


11 


6 


IT2FNN-2 


TRN 


1807.59 


558.53 


-491.29 


-1543.38 


-3093.98 


CHK 


1841.01 


650.47 


-400.32 


-1470.36 


-3024.99 


€ 


24 


18 


14 


10 


4 


IT2FNN-3 


TRN 


1618.12 


323.75 


-835.20 


-2019.46 


-3335.14 


CHK 


1664.53 


575.06 


-735.82 


-1971.84 


-3168.09 


s 


20 


15 


11 


8 


3 



Shaded cells in Table 2 show, based on AIC, that architecture IT2FNN-3 
(TRN/CHK) forecasts better Mackey-Glass chaotic series with 7=17 than ANFIS. 
In Table 3 it is shown that there is no significant improvement in architectures 
IT2FNN-1 (CHK) at SNR=10, 20 and noise-free compared with ANFIS. 
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Table 3. Statistic F (TRN/CHK) and g values determined in models ANFIS and IT2FNN 
with 10-fold cross-validation for forecasting Mackey-Glass chaotic time series with z=\l 



SNR(dB) 





10 


20 


30 


free 


ANFIS/IT2FNN-0 

F„ v5 (9,395) = 1.9036 


TRN 


11.2972 


15.4097 


8.5838 


24.6875 


16.8570 


CHK 


12.3887 


17.3879 


4.9648 


9.4877 


9.7261 


s 


28 


23 


17 


13 


7 


ANFIS/IT2FNN-1 

F„„ (89, 3 15) = 1.3070 


TRN 


2.1262 


2.8374 


1.3814 


10.1169 


1.9909 


CHK 


1.9890 


0.9823 


0.6265 


4.5609 


1.2781 


s 


25 


19 


15 


11 


6 


ANFIS/IT2FNN-2 
F MS (89,315) = 1.3070 


TRN 


3.4336 


6.3359 


6.3547 


6.2921 


3.6838 


CHK 


4.4051 


6.3387 


6.2921 


6.2368 


3.3978 


s 


24 


18 


14 


10 


4 


ANFIS/IT2FNN-3 
F MS (89, 3 15) = 1.3070 


TRN 


6.6461 


12.2542 


16.1434 


21.9371 


8.1609 


CHK 


7.7678 


7.9467 


15.6932 


23.1138 


5.6964 


s 


20 


15 


11 


8 


3 



Table 4. RMSE (TRN/CHK) and g values determined in models ANFIS and IT2FNN 
with 10-fold cross-validation for forecasting Mackey-Glass chaotic time series with r=30 



SNR(dB) 





10 


20 


30 


free 


ANFIS 


TRN 


0.3416 


0.1582 


0.0876 


0.0635 


0.0585 


CHK 


0.3800 


0.1714 


0.0943 


0.0672 


0.0619 


s 


NA 


NA 


NA 


NA 


NA 


IT2FNN-0 


TRN 


0.2543 


0.1278 


0.0633 


0.0512 


0.0444 


CHK 


0.2722 


0.1385 


0.0719 


0.0555 


0.0467 


s 


36 


29 


25 


19 


9 


IT2FNN-1 


TRN 


0.2097 


0.1123 


0.0517 


0.0413 


0.0326 


CHK 


0.2605 


0.1203 


0.0611 


0.0429 


0.0359 


g 


33 


27 


22 


17 


8 


IT2FNN-2 


TRN 


0.1719 


0.0812 


0.0415 


0.0301 


0.0297 


CHK 


0.1801 


0.0892 


0.0447 


0.0318 


0.0289 


G 


30 


26 


20 


16 


6 


IT2FNN-3 


TRN 


0.1521 


0.0733 


0.0338 


0.0249 


0.0212 


CHK 


0.1708 


0.0773 


0.0419 


0.0302 


0.0241 


S 


27 


23 


18 


8 


5 



Shaded cells in Table 5 show, based on AIC, that architecture IT2FNN-3 
(TRN/CHK) forecasts better Mackey-Glass chaotic series with T=30 than ANFIS. 
In Table 6 it is shown that there is a significant improvement in all architectures 
IT2FNN (TRN/CHK) model for all SNR values with respect to ANFIS. 
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Mackey-Glass Chaotic Time-Series x - 30 



— ANFISTRN 

— IT2FNN-0:TRN 
■ IT2FNN-1TRN 
— IT2FNN-2:TRN 

— IT2FNN-3TRN 
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Fig. 9. RMSE (TRN) values determined in models ANFIS and IT2FNN with 10-fold cross- 
validation for forecasting Mackey-Glass chaotic time series with t =30 
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Fig. 10. RMSE (CHK) values determined in models ANFIS and IT2FNN with 10-fold 
cross-validation for forecasting Mackey-Glass chaotic time series with t =30 
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Table 5. Akaike Information Criteria (AIC) of TRN/CHK and g values determined in 
models ANFIS and IT2FNN with 10-fold cross-validation for forecasting Mackey-Glass 
chaotic time series with r =30 



SNR(dB) 





10 


20 


30 


free 


ANFIS 


TRN 


2225.19 


1455.41 


864.33 


542.59 


460.58 


CHK 


2331.72 


1535.55 


938.03 


599.22 


517.07 


5" 


NA 


NA 


NA 


NA 


NA 


IT2FNN-0 


TRN 


1948.06 


1260.02 


557.43 


345.29 


202.79 


CHK 


2016.09 


1340.42 


684.83 


425.93 


253.29 


s 


36 


29 


25 


19 


9 


IT2FNN-1 


TRN 


1915.23 


1290.72 


515.01 


290.41 


53.86 


CHK 


2132.15 


1359.54 


682.06 


328.42 


150.29 


G 


33 


27 


22 


17 


8 


IT2FNN-2 


TRN 


1716.46 


966.46 


295.24 


25.93 


39.30 


CHK 


1763.06 


1060.43 


369.52 


29.02 


66.61 


s 


30 


26 


20 


16 


6 


IT2FNN-3 


TRN 


1594.09 


864.11 


90.01 


-215.58 


-376.45 


CHK 


1710.04 


917.24 


308.83 


-22.61 


-248.24 


s 


27 


23 


18 


8 


5 



Table 6. Statistic F (TRN/CHK) and g values determined in models ANFIS and IT2FNN 
with 10-fold cross-validation for forecasting Mackey-Glass chaotic time series with r=30 



SNR(dB) 





10 


20 


30 


free 


ANFIS/IT2FNN-0 

F us (9,395) = 1.9036 


TRN 


35.3061 


23.3632 


40.1645 


23.6202 


32.3015 


CHK 


41.6464 


23.3277 


31.6065 


20.4550 


33.2196 


s 


36 


29 


25 


19 


9 


ANFIS/IT2FNN-1 

F m , (89, 3 15) = 1.3070 


TRN 


5.8527 


3.4845 


6.6219 


4.8276 


7.8578 


CHK 


3.9920 


3.6454 


4.8913 


5.1452 


6.9830 


s 


33 


27 


22 


17 


8 


ANFIS/IT2FNN-2 
F MS (89, 3 15) = 1.3070 


TRN 


10.4374 


9.8952 


12.2307 


12.2126 


10.1922 


CHK 


12.2172 


9.5288 


12.2124 


12.2661 


12.6977 


g 


30 


26 


20 


16 


6 


ANFIS/IT2FNN-3 
F MS (89, 3 15) = 1.3070 


TRN 


143131 


12.9471 


20.2343 


19.4788 


23.4108 


CHK 


13.9798 


13.8620 


14.3880 


13.9852 


19.8096 


s 


27 


23 


18 


8 


5 
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Table 7. RMSE (TRN/CHK) and g values determined in models ANFIS and IT2FNN 
with 10-fold cross-validation for forecasting Mackey-Glass chaotic time series with r=100 



SNR(dB) 





10 


20 


30 


free 


ANFIS 


TRN 


0.4253 


0.1839 


0.1035 


0.0706 


0.0617 


CHK 


0.4677 


0.2035 


0.1227 


0.0758 


0.0669 


£ 


NA 


NA 


NA 


NA 


NA 


IT2FNN-0 


TRN 


0.3017 


0.1445 


0.0828 


0.0563 


0.0415 


CHK 


0.3322 


0.1497 


0.0898 


0.0681 


0.0483 


g 


50 


42 


38 


29 


13 


IT2FNN-1 


TRN 


0.2258 


0.1133 


0.0687 


0.0406 


0.0307 


CHK 


0.2976 


0.1235 


0.0779 


0.0473 


0.0397 


g 


48 


40 


36 


27 


11 


IT2FNN-2 


TRN 


0.1751 


0.0724 


0.0412 


0.0313 


0.0189 


CHK 


0.1899 


0.0783 


0.0570 


0.0398 


0.0296 


s 


46 


38 


34 


24 


8 


IT2FNN-3 


TRN 


0.1311 


0.0527 


0.0258 


0.0179 


0.0155 


CHK 


0.1377 


0.0554 


0.0311 


0.0212 


0.0133 


S 


39 


31 


28 


19 


6 
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Fig. 11. RMSE (TRN) values determined in models ANFIS and IT2FNN with 10-fold 
cross-validation for forecasting Mackey-Glass chaotic time series with r=100 
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Fig. 12. RMSE (CHK) values determined in models ANFIS and IT2FNN with 10-fold 
cross-validation for forecasting Mackey-Glass chaotic time series with r=100 



Table 8. Akaike Information Criteria (AIC) of TRN/CHK and £ values determined in 
models ANFIS and IT2FNN with 10-fold cross-validation for forecasting Mackey-Glass 
chaotic time series with r=100 



SNR(dB) 





10 


20 


30 


free 


ANFIS 


TRN 


2444.34 


1605.94 


1031.12 


648.58 


513.83 


CHK 


2539.38 


1707.21 


1201.29 


719.65 


594.75 


S" 


NA 


NA 


NA 


NA 


NA 


IT2FNN-0 


TRN 


2118.98 


1382.83 


825.98 


440.24 


135.24 


CHK 


2215.29 


1418.18 


907.13 


630.53 


286.98 


c 


50 


42 


38 


29 


13 


IT2FNN-1 


TRN 


1989.20 


1299.59 


799.30 


273.32 


-6.19 


CHK 


2265.30 


1385.79 


924.97 


426.06 


250.90 


g 


48 


40 


36 


27 


11 


IT2FNN-2 


TRN 


1734.91 


851.76 


287.99 


13.17 


-491.29 


CHK 


1816.05 


930.10 


612.60 


253.42 


-42.68 


s 


46 


38 


34 


24 


8 


IT2FNN-3 


TRN 


1445.51 


534.16 


-180.08 


-545.65 


-689.61 


CHK 


1494.63 


584.13 


6.76 


-376.45 


-842.69 


s 


39 


31 


28 


19 


6 
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Table 9. Statistic F (TRN/CHK) and g values determined in models ANFIS and IT2FNN 
with 10-fold cross-validation for forecasting Mackey-Glass chaotic time series with r=100 



SNR(dB) 





10 


20 


30 


free 


ANFIS/IT2FNN-0 

F M5 (9,395) = 1.9036 


TRN 


43.3268 


27.1968 


24.6875 


25.1267 


53.1238 


CHK 


43.1053 


37.2147 


38.0502 


10.4860 


40.3112 


s 


50 


42 


38 


29 


13 


ANFIS/IT2FNN-1 

F 095 (89,315) = 1.3070 


TRN 


9.0170 


5.7851 


4.4939 


7.1630 


10.7567 


CHK 


5.2022 


6.0705 


5.2415 


5.5501 


6.5113 


s 


48 


40 


36 


27 


11 


ANFIS/IT2FNN-2 
F„, (89, 3 15) = 1.3070 


TRN 


17.3411 


19.2960 


18.7967 


14.4677 


34.1803 


CHK 


17.9294 


20.3677 


12.8613 


9.2985 


14.5403 


s 


46 


38 


34 


24 


8 


ANFIS/IT2FNN-3 
F 095 (89,315) = 1.3070 


TRN 


33.7089 


39.5592 


53.4196 


51.5191 


52.5432 


CHK 


37.2914 


44.2169 


51.5527 


41.7074 


86.0115 


g 


39 


31 


28 


19 


6 



Shaded cells in Table 8 show, based on AIC, that architecture IT2FNN-3 
(TRN/CHK) forecasts better Mackey-Glass chaotic series with 7=100 than 
ANFIS. In Table 9 it is shown that there is significant improvement in 
architectures IT2FNN-1 (CHK) model for all SNR values with respect to ANFIS. 

4 Conclusions 

This paper models interval type-2 fuzzy neural networks (IT2FNN) architectures 
for forecasting Mackey-Glass chaotic time series. Combining neural networks and 
interval type-2 fuzzy logic systems improves learning, uncertainty handle and 
discovery, which are desirable characteristics for intelligent processing of 
imperfect information. Results show higher efficiency due to better representation 
capability and scalability. Mackey-Glass results analyzed with cross-validation, 
AIC and F tests, show that IT2FNN is a generalization of autoregressive models in 
times series case. 

Cross validation, AIC and F test contrast analysis with statistical models show 
that IT2FNN models have better results under high uncertainty or same results 
under low uncertainty as ANFIS models. 
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Abstract. This paper presents the proposed architecture for Ensembles of ANFIS 
(adaptive Network based fuzzy inference system), with emphasis on its applica- 
tion to prediction of chaotic time series (like the Mackey-Glass), where the goal is 
to minimize the prediction error. The methods used for the integration of the En- 
sembles of ANFIS are: Integrator by average and the integrator of weighted aver- 
age. The performance obtained with the Ensemble architecture overcomes several 
standard statistical approaches and neural network models reported in the litera- 
ture by various researchers. In the experiments we changed the type of member- 
ship functions and the desired error, thereby increasing the complexity of the 
training. 

1 Introduction 

System modeling based on conventional mathematical tools (e.g., differential 
equations) is not well suited for dealing with ill-defined and uncertain systems. By 
contrast, a fuzzy inference system employing fuzzy if-then rules can model 
the qualitative aspects of human knowledge and reasoning processes without 
employing precise quantitative analyses. 

This fuzzy modeling or fuzzy identification, first explored systematically by 
Takagi and Sugeno [12], has found numerous practical applications in control, 
prediction and inference. However, there are some basic aspects of this approach, 
which are in need of better understanding. More specifically: 

1) No standard methods exist for transforming human knowledge or experience 
into the fuzzy rule base and database of a fuzzy inference system. 

2) There is a need for effective methods for tuning the membership functions 
(MF's) so as to minimize the output error measure or maximize a performance index. 
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In this perspective, the aim of this paper is to suggest a new architecture for 
Ensembles of Adaptive-Network-based Fuzzy Inference System, or simply 
Ensembles of ANFIS, which can serve as a basis for constructing a set of fuzzy 
if-then rules with appropriate membership functions to generate the estipulated in- 
put-output pairs. The architecture of adaptive fuzzy inference systems proposed by 
[6] uses the gradient descent backpropagation [5] and least squares to achieve the 
learning ability of ANFIS. This paper reports the results of the simulations, where 
the Ensembles of ANFIS was used to predict chaotic time series of Mackey-Glass 
[8], where the results for each ANFIS were evaluated by the criteria of the root 
mean square error (RMSE). For the integration of the results of each Ensembles of 
ANFIS two different integration methods were used: integrator by average, and in- 
tegration of weighted average. The selection of this time series for the simulation 
was simply because it is a benchmark problem that has been widely quoted in the 
literature by different researchers, which allows to compare results with other 
approaches such as neural networks and linear regression. 

In the next section we describe the basics of ANFIS and the ANFIS Ensemble 
learning, like the architecture proposed in this paper. Section 3 presents the cha- 
otic time series, as well as the adjustment parameters of the ANFIS Ensemble. 
Section 4 presents the methods of integration of the Ensembles of ANFIS. 
Section 5 presents the simulation and the results obtained in this work. Section 6 
presents conclusions and future work. 

2 ANFIS and Ensemble Learning 

This section presents the basic concepts of ANFIS, to give us an idea of how the 
operation of ANFIS really is. In the second part we consider the learning, for un- 
derstanding how they apply to our proposal. In the last part we describe the 
architecture or structure we are proposing for the Ensembles of ANFIS. 

2.1 Basics Aspects of ANFIS 

This section presents the basic architecture of ANFIS and the hybrid learning that 
it uses. 

For simplicity, consider a fuzzy inference system with two rules of 
Takagi-Sugeno type [13]: 

Rule 1 : If x is Ai and y is Bj, then/; = pi + piy + rj 
Rule 2: If x is A 2 and y is B 2 , then / 2 = p 2 + p 2 y + r 2 
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Fig. 1. (a) and (b) illustrates the reasoning mechanism and the corresponding 
ANFIS architecture, respectively. The functions of the nodes in the same layer of 
ANFIS are of the same family of functions, as described below. (In subsequent, 
O 1 ; denotes the (th output node of the layer j. 



ZLES 
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"' f,=tox*iljy+r, 



L^ /j'ftK+fl^+f; 
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Saysr 2 layer 3 -l 




Fig. 1. (a) fuzzy model; (b) equivalent ANFIS architecture 

Layer 1: Each node in this layer corresponds to a linguistic label and the output 
node is equal to the value of membership in this linguistic label. The parameters of 
a node can change the shape of the membership function used to characterize the 
linguistic label. For example, the function of the rth node is 



Of=pA i (x) 



1 + [(Li vj^] 



(1) 



where k is the input node i; A t is linguistic label small, large, etc. Associated with 
this node, and {a,, £>,, c,} is the set of parameters. Parameters in this layer are 
called premise parameters. 



Layer 2: each node in this layer calculates the firing power of each rule: 
Of =a> t = itA t 00 X ttB t {y).t = 1, 2. 



(2) 



Layer 3: the fth node of this layer calculates the ratio of the firing strength of the 
ith rule of the sum of all the firing strength: 



<1>: = 



to x + w 2 



-,1 = 1,2 



(3) 
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Layer 4: node i in this layer has the following node function 

Of = ioJi = ft ( Pi X + qfl + r ( ) (4) 

Layer 5: the single node in this layer computes the overall output as the sum of all 
input signals: 

Si *>ifi (5) 



overall output = 0\ = £ a^/j = 



In this way an adaptive network has been built (Fig. 1. (b), which is functionally 
equivalent to inference systems (Fig. 1. (a)), so it is called ANFIS, referring to 
fuzzy inference systems based on adaptive networks. The basic learning rule of 
ANFIS is the gradient descent backpropagation [15], which calculates the error 
rates (defined as the derivative of the squared error for each output node) recur- 
sively from the output to the input nodes. This is the same learning rule used in 
neural networks [11]. In the basic architecture of Fig. 1. it may be noted that given 
the values of premise parameters, the total output / can be expressed as linear 
combinations of the parameters accordingly: 

= (ti 2 x)p 2 + l<*> 2 y)q 2 + (^z)*z (6) 

As a result we have a hybrid learning algorithm [7], which combines the gradient 
descent and least-squares estimation. More specifically, the step forward from the 
hybrid learning algorithm, functional signals (output nodes) are in progress to- 
wards layer 4 and the parameters of consequence are identified by least squares. In 
the step back and premise parameters are updated by gradient descent. 

2.2 Ensemble Learning 

Ensemble is a learning paradigm, where multiple component learners are trained 
for a same task, and the predictions of the component learners are combined for 
dealing with future instances [14]. Since an ensemble is often more 
accurate than its component learners, such a paradigm has become a hot topic 
in recent years and has already been successfully applied to optical character 
recognition, face recognition, scientific image analysis, medical diagnosis, etc 
[16]. 

In this paper, the learning of the total output of each Ensemble was calculated 
with integrators of average and weighted average, just as the output of each 
ANFIS is calculated with the root mean squared error RMSE. 
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2.3 Structure of the Ensemble of ANFIS 

In this paper, we propose the structure illustrated in Fig. 2. 



ANFIS 1 



AMFIS 1 structure resulting 




Fig. 2. The structure of the Ensembles de ANFIS 



This structure is divided into 5 sections, where the first section represents the 
data base to simulate in our Ensembles of ANFIS, which in our case is the data 
base of the Mackey-Glass time series [8] the second section training and valida- 
tion will be done sequentially in each ANFIS, where the number of ANFIS be 
trained can be from 1 to n depend on what the user wants to test, but in our case 
we are dealing with a set of 3 ANFIS in the Ensemble, in the third section we have 
to generate the results of each ANFIS trained in the previous section and the 
fourth section will integrate the overall results of each ANFIS, such integration 
will be done by direct average and weighted average is obtained and finally the 
outcome or the final prediction of the Ensembles ANFIS learning. 

The main idea of Ensemble learning in ANFIS, is that each ANFIS has differ- 
ent ways to train and thus to be simulated as if it was an expert system, i.e., give 
different points of view and then predict the time series, then take decisions or the 
results of each ANFIS and reach the same conclusion, then integrating the results 
and obtain the best prediction of the time series that is being simulated, to avoid 
future unexpected events. 
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3 Chaotic Time Series Prediction 

The problem of predicting future values of this time series has been a point of 
reference for many researchers. The aim is to use the values of the time series 
known at point x = t to predict the value of the series at some future point 
x = t + P. The standard method for this type of prediction is to create a mapping 
from D points spaced time series in A, is (x (t - (D - 1) A), ..., x (t - A), x (t)), to a 
predicted future value x it + P). To allow a comparison with previous results in 
this work [3], [9,10], [4] the values D = 4 and A = P = 6, were used. 

The used chaotic time series data used is defined as the Mackey-Glass time 
series (5), whose differential equation is defined as: 



dx(t) 0.2x (-t-r) 



dt 



1 + s? 10 (*-t) 



O.lx (f ) 



(7) 



For obtaining the values of the time series for each point, we applied the 
Runge-Kutta method for the solution of equation 7. The integration step was set at 
0.1, with initial condition x (0) = 1.2,x= 17, x (t) is then obtained for < t< 1200, 
which is illustrated in Fig. 3. (We assume x (t) = for t <0 in the integration.) 

Mackey-Glass Chaotic Time Series 





400 600 800 

Time (sec) 



Fig. 3. The chaotic time series 



From the Mackey-Glass time series we extracted 800 pairs of data points. 
Similar to [7], we predict x (t) from the three past values of the chaotic time series, 
that is, x (f-18), x (M2), and x (f-6). Therefore the format of the training data is 



[x(t - 18), x(t - 12), x(t - 6) ; x(t)] 



(8) 
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Where t = 19 to 818. The first 400 pairs of data are used to train the ANFIS, while 
the other 400 pairs of data are used to validate the model identification. The 
number of membership functions assigned to each entry of the ANFIS is fixed 
arbitrarily at 2, so that the number of rules is 8. Fig. 4. (a) shows the initial 
membership functions for each input variable and Fig 4. (b) illustrates the output 
membership functions after learning, but in our case the membership function type 
is varied. Note that the ANFIS used here contains a total of 50 parameters of 
adjustment, of which 32 are consequent (linear) parameters and 18 premise 
(nonlinear) parameters. 
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Fig. 4. (a) Initial membership functions before learning; (b) Final membership functions 
after learning 
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4 Methods of Integration 

There exists a diversity of methods of integration or aggregation of information, 
and we mention some of these methods below: 

Integration by average: this method is used in the Ensembles of ANFIS. This 
integration method is the simplest and most straightforward, consists en the sum 
of the results generated by each ANFIS is divided by the sum of number of ANFIS, 
and the disadvantage is that there are cases in which the prognosis is not good. 

Integration of weighted average: this method is an extension of the integration 
by average, with the main difference that the weighted average assigns importance 
weights to each of the ANFIS. These weights are assigned to a particular ANFIS 
based on several factors; the most important is the knowledge product experience. 
This integration method belongs to the well known aggregation operators. 

5 Simulation Results 

This section presents the results obtained through experiments on the structure of 
the ANFIS Ensemble, which show the performance that was obtained from each 
experiment to simulate the chaotic time series. Similarly, it is showing the results 
for each set of ANFIS that integrate our Ensembles of ANFIS. The results ob- 
tained by the integration by averages, and integration of weighted averages are 
shown in the last section. 

5.1 Ensemble 

In this Ensemble we have a set of 3 ANFIS learning, it is noteworthy that the type 
of membership functions the were assigned to each desired ANFIS were varied 
and the goal error was assigned to each ANFIS to 0.001 and the output of each 
ANFIS is evaluated by the following equation: 




RMSE " 



(9) 
5.1.1 ANFIS 1 

In this ANFIS 1 the triangular membership function was used, the prediction of 
output generated by the ANFIS 1 is shown in Fig. 5. (a), showing the trained data 
and the output of ANFIS 1, their differences are shown in Fig. 5. (b), which shows 
the prediction error between the desired data and data predicted by the ANFIS 1 . 
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(b) 
Fig. 5. (a) Desired (solid line) and Output (Dashed line) of ANFIS 1 ; (b) Prediction error 



5.1.2 ANFIS 2 



In this ANFIS 2 the sigmoid membership function was used, the prediction of out- 
put generated by the ANFIS 2 is shown in Fig. 6. (a), showing the trained data and 
the output of ANFIS 2, their differences are shown in Fig. 6. (b), which shows the 
prediction error between the desired data and data predicted by the ANFIS 2. 
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Dashes and Outputs of A WIS 2 
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Fig. 6. (a) Desired (solid line) and Output (Dashed line) of ANFIS 2; (b) Prediction error 



5.1.3 ANFIS 3 



In this ANFIS 3 the Il-shaped built-in membership function was used, the predic- 
tion of output generated by the ANFIS 3 is shown in Fig. 7. (a), showing the 
trained data and the output of ANFIS 3, their differences are shown in Fig. 7. (b), 
which shows the prediction error between the desired data and data predicted by 
the ANFIS 3. 



Chaotic Time Series Prediction Using Ensembles of ANFIS 



297 



D»red md OutjHi of JUfFIS 3 




(a) 



Pnddkn E litre ofWFIS 3 




(b) 
Fig. 7. (a) Desired (solid line) and Output (Dashed line) of ANFIS 3; (b) Prediction error 

5.2 Integration 

This section presents the results obtained by the methods de integration of average 
and weighted average, which one used for simulation in our experiments of the 
Ensembles of ANFIS in order to obtain a good forecast of the simulated time 
series. 



5.2.1 Integration by Average 

For obtaining the Ensemble average result of ANFIS, the integration method of 
average was implemented, where the results of each ANFIS are add and divided 
between the same numbers of ANFIS, this result is shown in Fig. 8. (a), which 
compares the actual data trained against the integrator settles for average, their dif- 
ferences are shown in Fig. 8. (b), which shows the prediction error between the 
desired data and data predicted by integrating on average. 
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Fig. 8. (a) Desired (solid line) and Output (Dashed line) of integrator by average; 
(b) Prediction error 



5.2.2 Integration of Weighted Average 

For obtaining the result in a more accurate Ensembles of ANFIS, in this case the 
method of integration by weighted average is implemented, which was given a 
weight depending on the results obtained from each ANFIS, these weights were 
assigned manually, where the lowest error among the ANFIS output is assigned a 
weight of 0.50, the ANFIS that had the second best average error was assigned 
0.30 and the ANFIS with biggest error was assigned a weight of 0.20, thus obtain- 
ing 100% of the weights assigned among the Ensembles ANFIS. Fig. 9. (a), shows 
the comparison between the training data and output data generated by the integra- 
tion by weighted average, their differences are shown in Fig. 9. (b), which shows 
the prediction error between the desired data and data predicted by integrative of 
weighted average. 
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(b) 

Fig. 9. (a) Desired (solid line) and Output (Dashed line) of integrator of weighted 
average; (b) Prediction error 



5.3 Results 

The results shown in Table 1 are the 5 best experiments obtained with the Ensem- 
bles of ANFIS, it is noteworthy that in each experiment we varied the type of 
membership functions and the goal error, thereby increasing the complexity to 
Ensemble ANFIS. 
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Table 1. Results of different Ensembles of ANFIS 



Ensemble 


ANFIS 


Error 

Goal 


Kind 

of 
MF 


RMSE 


Integra- 
tion 

by 

Average 


Integration 

of Weigh- 
ted 

Average 


1 


ANFIS 1 


0.01 


gbellmf 


0.0187 


0.0183 


0.0182 


ANFIS 2 


gaussmf 


0.0189 


ANFIS 3 


gauss2mf 


0.0179 


2 


ANFIS 1 


0.001 


trimf 


0.0202 


0.0174 


0.0173 


ANFIS 2 


dsigmf 


0.0186 


ANFIS 3 


pimf 


0.0176 


3 


ANFIS 1 


0.0001 


gauss2mf 


0.0181 


0.0176 


0.018 


ANFIS 2 


gbellmf 


0.0187 


ANFIS 3 


dsigmf 


0.0186 


4 


ANFIS 1 


0.00001 


gaussmf 


0.0189 


0.0183 


0.0182 


ANFIS 2 


gauss2mf 


0.0181 


ANFIS 3 


gbellmf 


0.0187 


5 


ANFIS 1 


0.000001 


trimf 


0.0202 


0.0178 


0.0175 


ANFIS 2 


gauss2mf 


0.0181 


ANFIS 3 


pimf 


0.0176 



6 Conclusions 



In conclusion we can say that the results obtained with the proposed Ensembles of 
ANFIS architecture have been good and positive in predicting time series (like the 
Mackey-Glass), as it has managed to minimize the prediction error of the time 
series against the results obtained by other researchers. 

These results obtained by our Ensembles of ANFIS architecture, managed to 
reach 98% of prediction, which offers on efficient outcome and avoids us prevent 
unexpected events in the future. 

As future work we will also consider the Mexican Stock Exchange and the 
Dow Jones time series, to verify that the proposed approach can provide good 
results and that they can be better than those obtained by other researchers. 
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Abstract. An intelligent virtual agent is an intelligent agent with a digital repre- 
sentation and endowed with conversational capabilities; this interactive character 
exhibits human-like behavior and communicates with humans or virtual interlocu- 
tors using verbal and nonverbal modalities such as gesture and speech. Building 
credible characters requires a multidisciplinary approach, the behaviors that must 
be reproduced are very complex and must be decoded by their interlocutors. In 
this paper we introduce a kinesics model in order to select the nonverbal expres- 
sion of intelligent virtual agents able to express their emotions and purposes using 
gestures and face expressions. Our model select the nonverbal expressions based 
on causal analysis, and data mining algorithms applied to perceptual studies in 
order to identify significant attributes in emotional face expressions. 

1 Introduction 

The intelligent virtual agents are increasingly present on the user interfaces be- 
cause they try to simulate the forms of human communication and thus to improve 
the human-machine interaction. Their design involve a multidisciplinary approach 
combining computer sciences, psychological, sociological and philosophical is- 
sues [1], the challenge is to endow them with humans characteristics like emotions 
and personality, and thus to allow the suspension of the disbelief. 

An intelligent virtual agent is an intelligent agent with a digital representation 
and endowed with conversational capabilities; when this interactive character ex- 
hibits human-like behavior and communicates with humans or virtual interlocutors 
using verbal and nonverbal modalities such as gesture and speech they are known 
as Embodied Conversational Agent (ECA) [2]. These kinds of agents try to trans- 
mit an illusion of life and reproduce the richness and the dynamics of the human 
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behavior [3]. Characters able to transmit this illusion could more easily produce an 
immersion's experience in a virtual world. They are being used in educative 
applications like virtual tutors, or as presenters or guides in visits to virtual muse- 
ums. Also they could be found as salesmen or real estate agents, personal trainers 
and therapists, among others applications. 

These agents become visible in the social interfaces like animated actors who 
represent human beings and are used to improve the interactivity with the users. In 
order to improve this interactivity, it is not enough that the virtual character looks 
real to be believable and catch the interest of the user. The expression of the inter- 
action must be given in a natural way, for that reason, the character must behave 
and communicate in the same way that a human do, using verbal and nonverbal ex- 
pressions. Gratch et al [1] identify that the face-to-face conversation, emotions and 
personality, and human figure animation are the key issues that must be addressed 
in creating virtual humans. 

Following these issues, the creation of virtual characters has evolved, starting 
with the creation of fantasy characters until virtual humans. The first projects de- 
veloped worked with a specific set of facial expressions from a corpus of facial ex- 
pressions product of the work conducted by Ekman and his colleagues[4]. These 
projects were talking faces trying to reproduce emotional expressions [5-7] and 
specific nonverbal functions in order to synchronize the verbal expression [8]. 
Other works express blend of emotions of different types and performative of the 
communicative act being performed [9]. 

The interest of our work is the selection of the facial expression of a virtual 
character that can express itself in a nonverbal way his emotional context coher- 
ence. The inherent difficulty to this task occurs because the human being can 
experience a variety of emotions, which are represented by a wide range of 
expressions, and are influenced by the traits of personality and the context. 

In this paper we introduce a kinesics model in order to model the nonverbal ex- 
pression of intelligent virtual agents able to express their emotions and purposes 
using gestures and face expressions. Our model is based on causal analysis, and 
data mining algorithms applied to perceptual studies in order to identify signifi- 
cant attributes in emotional face expressions and thus design a selection model of 
facial expressions. 

2 Expressing Emotional Behavior 

There are several ideas about what can give believability to virtual agents. For 
Badler [3], what makes a virtual human « human » is not just a well-executed ex- 
terior design but movements, reactions, and decision-making which appear « natu- 
ral », appropriate, and contextually-sensitive. To be able to produce this behavior, 
these life-like characters must communicate credibility in their emotional expres- 
sions (coherent reaction), exhibit personality and be able to express their indexical 
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and reflexive behavior, which are transmitted by verbal, paraverbal and non verbal 
activity, showed through facial, vocal and gestural expressions. 

In this work we used a behavioral model proposed by Morales [10] to handle 
the perceptions of the agent and determine its actions, on the basis of his emo- 
tional state and traits of personality. This model works with a series of attributes 
that define the emotion, mood, attitude, intention and objective of the agent. These 
elements models the behavior and are constantly updated, we characterized and 
used them for the selection of the emotional expression. 

The Fig. 1 shows the general architecture of a virtual character, where the out- 
put data of the behavioral model are taken by a Kinesics model in order to select 
and integrate the no verbal expression of the virtual character. 



Behavioral Model 

Intensity of the emotion, labels of the emotion, intention, 
attitude and mood 



Kinesics Model 



Facial Expression 
Selection Model 



c 



Gesture Expression 
Selection Model 

I 



Integration of Facial and 
Gestures Expressions 



Animation of Expressions 



Fig. 1. Components of a general architecture for the animation of a virtual agent 



An interesting detail to being mentioned is the way in that the behavioral model 
handles the emotional state of the agent. In order to handle and to represent the 
emotions of the agent a circumflex model is used, divided in eight regions by four 
axes that represent basics psychological concepts. According to this approach, an 
emotion is represented by a set of psychological characteristics that allow to 
define it and to classify it. The circumflex in Figure 2 shows the distribution of the 
top four recognized emotions: fear, anger, sadness and joy. 

The emotion label selection is based on its position in the space delimited by 
the four axes. For example, the emotion of anger is in the portion of space charac- 
terized by positive values of stress, arousal and stance, as well as a negative value 
of valence. 
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.Stress 



^■Arousal 




+ Valence 



Fig. 2. Classification of the emotions in four dimensions according to the proposed 
circumflex model in Morales [10] 

In this paper we detail the process related to the facial expression in the kine- 
sics model, we examined the existing relations between the face expressions and 
the attributes given by the behavioral model like intention, objective, emotion, 
mood and attitude, these relations are going to be used to determine the design of 
the selection model. 

The facial expressions are created by all the possible combinations of a total of 
5 eyebrows and 4 mouths shapes, giving a total of 20 different images (see Fig. 3). 
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Fig. 3. The twenty resulting faces of the combinations of the different eyebrows and mouth 
shape 



3 Characterization of Behavioral Attributes 



In order to analyses the existing relations between the face expressions and the at- 
tributes given by the behavioral model, we implemented a survey where the 
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participants characterized the set of twenty images using the attributes attitude, 
intention, mood and emotion of the behavioral model. 

The information generated, was used to develop a causal analysis to observe the 
weight of each attribute. We did the causal analysis using Tetrad 1 (Fisher's 
Z test, alpha value of 0.01) and we obtain that the attribute « mood » is the most 
significant of all (Fig. 4). 



Intention 




V., 


Attitude 


Mo 




ml 







Fig. 4. Causal analysis of the attributes that characterized the face expressions 

We also apply data mining using Naive-Bayes and ID3 classifiers to identify 
the best method to predict the expression that corresponds with the attributes of 
the behavioral model. The Fig. 5 shows the percentage of success of these classifi- 
ers, we note that the Naive-Bayes classifier was the best classifier with a 50% of 
success. 
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Fig. 5. Success percentage of classifiers 



Although the Naive-Bayes classifier has low performance, we can observe that 
the predicted images do not vary too much from the expected one (Table 1). 



1 In http: / /www. phil .cmu.edu/projects/tetrad it is possible to find more 
information and free download software of Tetrad Project. 
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Table 1. Comparison between the results obtained by the Naive-Bayes classifier with the 
image that we expected in the cases in that the classifier failed 



Expected 
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In order to distinguish the face expressions, we develop another survey that ask 
to the participants the identification of the emotion expressed by the face through 
the characterization of the four axes showed in the circumflex. With the new in- 
formation we can built a new circumflex, with the most popular answers of the 
survey (See Fig. 6). 



+ Stress 




+ Arousal 



+ Stance 



Pleasure 

\ 

+ Valence 



Fig. 6. New allocations of the emotional labels in the circumflex 
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This last survey, gives us the information to verify the interpretation of an 
emotion based on the four dimensions that characterize the circumflex. Now we 
are able to use the data of the emotional axes to identify an emotion and their in- 
tensity. This information has been used in the design of the selection model. 

4 Design of the Face Expression Selection Model 

The selection model of face expressions works with the information generate by 
the behavioral model and with the corpus product of the surveys. 

The model uses the corpus that has been classified by the Mood and by the four 
axes that characterizes an emotion. This corpus is processed in two layers in order 
to reduce it and treated by a Naive Bayes algorithm. 

The face expressions selection model can be appreciated in Figure 7 and their 
main algorithm in Table2. 

Behavioral Model 



Kinesics Model 

Facial Expression 
Selection Model 



Mood 



Data base of Face 
Expressions 

Characterization by 

attributes 



HOOD 
Fitter 



Subcatalogue 

of 
expressions 



Selection Method of Facial 
Expression 



Gesture Expression 
Selection Model 



\ 



Integration of Facial and Gestures Expressions 



Fig. 7. Structure of the Selection Model of Facial Expression 



The first layer of the model apply a mood filter, where we will choose only the 
images that were characterized with the current mood value of the behavioral 
model. 

We have chosen mood as the filter attribute, because of the causality tests. The 
result of this filter will give us a sub catalogue of expressions. 



310 



M.L. Morales-Rodriguez, F. Medellm-Martfnez, and JJ. Gonzalez B. 



Table 2. Algorithm of the selection model of face expressions 

Faria l_Expression_Model(Facial_Ex_DB, attribu tes ) 

1 . For each gesture € Facial_Ex _BD such that moodgesture = moodattributes 

2. add gesture into First_Candidates_ Facial_Ex 

3. end for each 

4. Final_ Facial_Ex <- ExpressionSelection ( Facial_Ex _DB , First_Candidates_Gestures, attributes ) 

5. return Final_ Facial_Ex 



This sub catalogue and the attributes of attitude, intention and emotion are used 
in the second layer : (see Fig. 8). 



Behavioral Model 



Emotion 
Intention 
Attitude 



Intensity 



Naive Bayes 
classifier 



Expressions 
candidates 



Facial Expression Selection Method 




Facial 

Expression 

Selection 



Fig. 8. Structure of the Selection Method of Facial Expression 

This second process selects from the database the face expression that better 
match to the label of the emotion, intensity and attitude. In Table 3 we observed 
the detail of the selection process. 



Table 3. Function ExpressionSelection 



ExpressionSelection (First_Candidates_ Facial_Ex, attributes) 



1 . Second_Candidates_ Facial_Ex 4r TopRankingNaiveBayes(First_Candidates_ Facial_Ex, attributes) 

2. Final_ Facial_Ex 4r FaceExpSelection(Second_Candidates_ Facial_Ex, attributes) 

3. return Final_ Facial_Ex 



This method calls a Naive Bayes Classifier to select the three better expressions 
(function TopRankingBayes, see Table 4), these face expressions could be 
variations of a same emotion affected by its intensity. 
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Table 4. Function that return the three Top Ranking of NaiveBayes 

TopRankingNaiveBayes (Facial _Ex, A ) 

1 . For each fe Facial_Ex 

2. Probabilitiesf <r 

3. For each a e A 

4. Probabilitiesf <- Probabilitiesf + ObtainProbability(a) 

5. end for each 

6. End for each 

7. Order Facial_Ex by Probabilites 

8. return Facial_Ex Q and Facial_Ex 1 and Facial_Ex 2 

The next step uses the intensity of the emotion to select between the group of 
three facial expressions with the highest Naive Bayes probabilities, see the 
function FaceExpSelection (Table 5). 

Table 5. Algorithm with the final selection 

FaceExpSelection (Facial _Ex _DB , Can didates, Facial_ Ex, A ) 

1. Intensity ir EmotionIntensity(ArausalA, StressA, StanceA, ValenceA, a) 

2. For each gesture e Candidates_Facial_Exgesture 

3. sum 4r 

4. For each emotion e Gestureemotions 

5. Probabilities 4r Probabilityemotion * Emotionintensity 

6. sum <- sum 4- Probabilities 

7. end for each 

8. Probability gesture 4r sum 

9. end for each 

10. return Max Probabilitygesture e Candidates_Facial_Exgesture 

The intensity of the emotions involved in the three better facial expressions is 
calculated by the operation described in Table 6. This calculus take the axes val- 
ues related with the definition of the emotion. The intensity is obtained from a 
sum of the values of the four axes, where the axes are going to have a different 
weight. The adjacent axes to the label of the emotion have a smaller weight than 
the nonadjacent axes. 

Table 6. Intensity calculation of each emotion label 

Emotion Label Inten sity Calcula tion 

Anger (a)[(-l)(Valence) + Stance]+(l-rx)[Stress + Arousal]) 

Surprise (a)[Valence + Stress]+(l-a)[Stance + Arousal] 

Joy (a)[Arousal + (-l)(Stress)]+(l-a)[ Stance + Valence] 

Pleasure (a)[(-l)(Arousal)+Stance]+(l-a)[(-l)(Stress)+ Arousal] 

Relaxation (a)[Valence +(-l)(Stance)]+(l-a)[(-l)(Stress)+(-l)(Arousal)] 

Sadness (a)[(-l)(Valence)+(-l)-(Stress)]+(l-a)[(-l)(Stance)+(-l)(Arousal)] 

Distress (a)[(-l)(Arousal) + Stress]+(l-a)[(-l)(Stance)+(-l)(Valence)] 

Fear (a)[Arousal +(-l)(Stance)]+(l-a)[Stress +(-l)(Valence)] 

5 Experimental Results 

In Table 7, we present a sequence of executions of the Behavioral Model and the 
experimental results obtained. 
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Table 7. Executions of the system 



Input 



Axes 



NaiveBayes Candidate Images 



Final 
Selection 



Mood: 

Negative 

Emotion: 

Fear 

Attitude: 

Directive 

Intention: 

Demand 



Stress : 

0.639 
Arousal: 
9.0 
Stance: 

-1.164 
Valence: 

-4.343 



' V',"»-\ 



Naive Bayes 

0.00291307382 
Intensity 

0.0523153323 



Naive Bayes 

0.001356211169 
Intensity 

0.013809930878 



NaiveBayes 

0.00120193359 
Intensity 

0.083002777777 




Mood: 

Negative 

Emotion: 

Fear 

Attitude: 

Directive 

Intention: 

Inform 



Stress: 

0.513 
Arousal : 
9.0 
Stance : 

-0.156 
Valence : 

-1.156 



NaiveBayes 

0.00145653691 
Intensity 

-0.032766633 



NaiveBayes 

0.000997146937 
Intensity 

0.014757436922 



Naive Bayes 

0.000801289062 
Intensity 

0.063547222222 




Mood: 

Neutral 

Emotion: 

Fear 

Attitude: 

Directive 

Intention: 

Demand 



Stress : 

0.41 

Arousal : 
9.0 

Stance : 
-0.125 
Valence : 
-0.978 




Naive Bayes 

0.0000000495 
Intensity 

-0.03043611111 






Mood: 

Neutral 

Emotion: 

Fear 

Attitude: 

Directive 

Intention: 

Reprimand 



Stress : 

0.363 
Arousal : 
9.0 

Stance : 
-0.123 
Valence : 
-0.907 



^ — -. 11 

55 



Naive Bayes 

0.00015470312 
Intensity 

-0.11520138888 




V> 



Naive Bayes 

0.000000090008 

Intensity 

-0.06280523702 




The table shows the inputs to the selection model, the three images with the 
best probabilities according to Naive Bayes classifier and the image of the final 
selection. Each image presents their probabilities and emotional intensity values. 
The final selection is the image with the maximum intensity. 

In these examples we observe that in some cases, the final image selected does 
not correspond with the image of the maximum probability of the Naive Bayes 
classifier, this could be influenced by the use of several inputs (attributes) with 
same weight in the classifier. The addition of the emotional intensity gave to the 
final selection a greater importance to the emotional state and define the final 
selection. This procedure follows the results of the causal analysis, where the 
emotion is second attribute with a greater weight. 

6 Conclusions 



In this paper we present a model of facial selection as part of a kinesics model for 
intelligent virtual agents able to express their emotions and purposes using 
gestures and face expressions. 
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The difficulty of this task occurs because the human being can experience a 
variety of emotions, which are represented by a wide range of expressions, which 
are influenced by the traits of personality and the context. Our model selects the 
face expressions influenced by a causal analysis of the attributes implied in the 
behavioral model of the virtual character and by a Naive Bayes Classifier. The se- 
lection model processing a face corpus according to the weight of these attributes 
and intensity. The model searches provide a contribution to the improvement of 
the virtual humans believability. 
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Abstract. The ability to understand the world around us is crucial to successfully 
act within an environment. For such a task it is important to understand relations 
and context between objects and actions. The goal is to have an agent capable of 
understanding its environment and by using knowledge of how it works to take ac- 
tions. The term environment is defined as physical places, virtual place and social 
situations an agent can encounter. Current agent architectures do not specify how 
knowledge should be handled from its environment or how to generate knowledge 
from experiences. This work focuses on using semantic networks to represent in- 
formation an agent acquires from its environment, allowing communication in a 
more meaningful way by transferring information from the semantic network. An 
agent is able to have an abstraction of the environment to the extent that it is capa- 
ble of interacting within it. Experiments with this method are done using a custom 
tool called Wiinik, this tool allows creating hierarchical agents using scripts as be- 
havior rules. Semantic networks, although not standardized, have great potential to 
model any kind of knowledge, regardless of an agent's capability. In this paper we 
describe how knowledge modeled as semantic networks is beneficial for agent de- 
cisions and facilitates the transfer of information between agents. Additionally, we 
describe the architecture of a software tool developed for social simulations and 
we discuss two case studies that were implemented using this software. 

Keywords: Semantic networks, Communication, multi-agent systems, rule engine, 
fuzzy systems. 



1 Introduction 

Humans are social beings by nature. We rely on others to perform specialized 
tasks within a community and each individual plays a role. Only the ones with the 
correct knowledge and ability can perform certain tasks adequately even though 
most members are physically capable of performing such tasks. The complexities 
of the interactions of any modern or primitive society go well beyond these 
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statements, but they do provide a good starting point for developing behavior 
models and software tools that would allow us to implement simple agents that are 
capable of interacting with each other within a limited environment. 

Acquiring and sharing information amongst a group is a vital part in the devel- 
opment of any society. Each discovery will lead a group closer to a common goal 
and thus ensuring the group's survival or well being. Being able to represent an 
agent's acquired knowledge in such a way that could be easily extended to different 
yet similar domains would greatly increase the ability to solve problems. Further- 
more, the ability to transmit knowledge between individuals requires certain abili- 
ties and a common knowledge of basic concepts that can be represented by sounds, 
written signs or gestures. Simulating these abilities in a limited environment pre- 
sents many challenges, amongst which are the general representation of any kind of 
knowledge, the extraction of relevant information from that knowledge and 
transmitting in a way that can be understood by any number of intended receivers. 

One of the reasons for developing Wiinik is to create a platform that would allow 
us to experiment with different behavior theories and knowledge representations that 
can ultimately result in agents capable of understanding their environment and use 
knowledge of how it works to take action. The ability to explore different possibilities 
helps us answer questions as to which approaches perform best in specialized situa- 
tions and in general situations. Ultimately being able to optimize general situation 
methods to specific situations would allow agents to adapt to changing environment 
and work with partial information. 

Probability theory has been used in agent-based modeling in order to model so- 
cial processes and dynamics [11]. These models are based on the interactions of 
individual entities, named "agents". These agents use simple behavioral rules, 
local to their simulated environment [12]. The use of Fuzzy Logic to model per- 
sonality in agents has previously shown satisfactory results [18]. Computational 
models such as AvatarSim, FLAME, and PETEEI use fuzzy logic to simulate 
emotions and personality in agents [5]. In Wiinik, an agent's behavior is deter- 
mined by a fuzzy inference system (FIS), which is designed to define the agent's 
psychological elements. Semantic networks are used to represent the information 
acquired by the agent from its environment, allowing the agent to communicate in 
a more meaningful way. Semantic networks, although not standardized, have great 
potential to model any kind of knowledge, regardless of an agent's capability. 

Another important factor for social developments and interactions is the sur- 
roundings. The surrounding environment determines what options are available 
and what goals becomes a priority, among other things. In order to work with a 
simplified model yet still have results that can relate to real situations we consider 
three types of environments: physical, virtual and social. A physical environment 
represents what an agent can perceive as input from the physical world. A virtual 
world is a combination of concepts that can be based on previous physical envi- 
ronment experiences or fictional experiences. Lastly, a social environment repre- 
sents the rules that determine what relation objects have to a current situation, 
what is appropriate to do, and other similar information. An agent is able to have 
an abstraction of the environment to the extent that it is capable of interacting 
within it. 
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Our work on the Wiinik platform has been focused on addressing all of these 
issues. We are able to work with multiple agents and environments interacting and 
behaving according to scripts based on logic rules. Behavior scripts have great 
flexibility that allows the implementation of different behavior patterns and 
mechanisms for decision-making. Similar work can be found for language proc- 
essing[13] and behavior modehng[10], since each is a monumental task on its 
own, few alternatives are available for experimenting with both. 

In the following sections, we present a brief description of important concepts 
used throughout this document, a description of the Wiinik platform is discussed 
and in the final sections we present a discussion of two case studies that have been 
implemented using Wiinik. 

2 Basic Definitions 

In this section basic definitions for semantic networks and agents are provided 
before describe how each concept contributes toward simulating human behavior. 

2.1 Semantic Networks 

A semantic network is a simple representation scheme that uses a graph of labeled 
nodes and labeled directed arcs to encode knowledge. Nodes represent either ob- 
jects, concepts or events and arcs represent relationships between nodes [16]. As 
an extension of directed graphs, semantic networks can be formally defined as 

G = (V,E). (1) 

where V is the set of nodes and E is the set of arcs denoting the different unions 
between nodes. Different types of semantic networks have been defined such as 
IS-A, conceptual graph and frames in [2,14,19]. Unfortunately one of the greatest 
strengths of semantic networks for representing knowledge is also one of its great- 
est weaknesses, almost any idea that is understood can be expressed in a semantic 
network but this greatly depends on the interpretation of the designer and there is 
no single correct answer to the description. This facilitates descriptions of objects 
and phenomena at the expense of complicating interpretation, data extraction and 
transmission. 

2.2 Definition of Agents 

An agent is defined as a software entity which functions continuously and auto- 
nomously in a particular environment, often inhabited by other agents and 
processes. A formal definition of an agent A is given as 

A g =(Sit,Act,Dat,f Ag ). ( 2 ) 

where: 
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1 . Sit is a set of situations an agent can be in, 

2. Act is a set of actions that can be performed by an agent, 

3. Dot is a set of values that an agent internal memory can have, and 

4. f A is the behavior function defined as f A : Sit x Dat — > Act depends on the 
current situation and the current memory values to determine the next action 
[6]. 

The main focus of this work is on the element Dat , this component will consist of 
a semantic network containing all the knowledge an agent has acquired. Relevant 
information from this set is extracted as needed either to decide future actions and 
to exchange ideas with other agents. For this work every agent needs to have an 
understanding of basic concepts that are defined within Dat . The amount of in- 
formation that this set can hold depends greatly on the application or experiment 
that an agent is designed for. 

The Act set limits an agent to perform only actions that are defined within this 
set. Changing this set would have significant effects on f A results and thus 

changing the experiences that are lived within an environment. This parameter is 
useful in order to simulate different types of beings or same beings with slightly 
different physical abilities. 

f A is defined as a behavior rule script given to each agent. As knowledge is 

acquired and new methods are discovered, an agent can add new rules or modify 
old rules in order to adapt to changing circumstances. This also allows for agents 
that were identical at one point to behave differently with time. 

3 Description of Wiinik Tool 

Wiinik is a software tool developed to help model human behavior, interactions 
and reasoning in an effort to imitate as closely as possible the constraints that exist 
in real life as best as they are understood. 

This software allows for many possibilities by combining several technologies, 
such as Multi-Agent Systems (MAS), rule-based programming, fuzzy logic, and 
more. 

Wiinik follows the agent model proposed by [4], where each agent is made up 
of one Persona agent and one or more cognition agents. In section 3.1 we will de- 
scribe this type of agent and others that we used. This allows for the Transactional 
Analysis theory [1,20] to be easily implemented and experimented on. Each agent 
may have a unique composition giving greater diversity on the types of situations 
and theories that can be simulated. 

3.1 Definition of a MAS 

Wiinik makes use of the open source platform Java Agent Development frame- 
work (JADE) [3] that handles all of the MAS operations. These include communi- 
cation and mobility among others. JADE was chosen as it is fully implemented in 
Java, compliance with the Foundation for Intelligent Physical Agents (FIPA) stan- 
dards [7], and its role as middle- ware allows quick and organized development. 
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3.2 Rule Engine 

The rule engine is a very important approach of this project's design goal. Making 
use of such a tool allows for an agent to be easily customized without affecting 
any other and can even change its own code during the simulation. 

Jade Expert Shell System (JESS) [8] is what was chosen for this project. Be- 
cause it is implemented in Java, it can be easily integrated with the JADE plat- 
form. JESS is used as the scripting language to handle the knowledge base and 
interaction rules for each agent. When an agent is created it is given a name and a 
JESS script to control its behavior. 

3.3 Agent Mechanics 

Each agent is designed to resemble a real person as closely as possible, thus its 
behaviors are designed to represent physical actions instead of fixed instructions 
to perform. Fig. 1 shows a diagram of an internal agent structure and interaction. 
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Fig. 1. Internal agent structure 

An agent action set Act from tuple 2 is defined as Act = {Say , Listen} for 
message passing only. But, if we include all possible actions then we have 



Act = {Say, Listen,Think, Move) . 



(3) 



where: 



1. Say behavior. This behavior is scheduled when an agent needs to communi- 
cate. It will arrange the message in a proper format and send it to the 
corresponding receiver. 
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2. Listen behavior. This behavior is responsible for receiving any incoming 
messages and either resend to all cognition agents or add message to knowl- 
edge base. Only messages coming from other Persona agents are processed in 
this way. 

3. Think behavior. Whenever there is new input to process, this behavior will al- 
low the rule engine to execute and if there is any output it will process it 
accordingly. 

4. Move behavior. Represents the ability of an agent to move between different 
environments. 

Figure 1 shows an Ego State behavior; here all cognition agent messages are re- 
ceived by this behavior. Once all agents have responded to an event, it takes the 
message with greater value and creates a Say behavior so that it can be sent. Fi- 
nally, rule engine component is what gives an agent the ability to process the input 
collected from the environment and output actions to be performed or generates 
new knowledge based on information already obtained. 

Definition of Persona Agent. Persona agents are named after the Latin word 
"Persona" which translates to mask. The purpose of this type of agents is to man- 
age all the actions that are sent by Cognition agents and choose which to perform. 
It consists three main behaviors: Listen, Ego State, and Say. Received messages 
are sent to all grouped cognition agents and their output is received by the Ego 
State behavior for evaluation, once a decision is made, the agent takes the 
appropriate action. 

Definition of Cognition Agent. The Cognition agent allows the agent as a whole 
to think and act. Even though its architecture is very similar to a Persona agent, 
there are a few key differences. The knowledge base receives input when a mes- 
sage is received from the Persona agent or when it is informed of a new agent be- 
ing created. The rule engine is activated for every new input. If there is any output, 
a Say behavior will be assigned to inform the Persona agent and have it evaluate 
the decision. 

Definition of Environment Agent. When dealing with agent interaction within an 
environment, there is a need to keep track of some values. This agent can handle 
any information necessary such as keeping track of the population, weather and 
traffic among others and can be consulted. 

3.4 Message Structure 

A message M is formally defined as a 7-tuple that holds information that allows a 
deeper interpretation of messages beyond the literal meaning of the contents. 
M holds information that is based on perceptions and can be integrated into the 
knowledge base adding context information for any given event, we have: 

M = (Id,T,P,C,BPNM,A,Ag RS ). (4) 

where: 
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1. Id is a unique identification value used to associate a message with an event 

allowing an agent to interact without any confusion when processing many 
concurrent events. 

2. T is the topic of the conversation. It is used to identify the context in which a 

message is used so that it can be interpreted accordingly. 

3. P is the performative, it states the reason, purpose or intention of the message. 

4. C is the content of the message. This element can be a simple text message or 

formatted in such a way that in can be directly integrated into an agent's 
knowledge base and processed as data with behavior rules. The content can 
also be formatted as a new behavior rule to be added and thus expand an 
agent's ability to act upon new data or correct behavior patterns. 

5. Behavior Profile Node Message BPNM = {A,ES i ,ES u ,7C i ,n u ) . The BPNM 

element is proposed by [9], where: 

1. /l = [0..1]. A. represents the perception an agent has over a specific 
event. It is a single real value between and 1 inclusive calculated using 
fuzzy logic. 

2. ES = [Parent, Adult, Child] . ES t and ES U represent the ego state an 

agent is currently in and perceives respectively. The way an agent acts, 
what goals it decides to pursue and what information will be relevant for 
decisions depends greatly on this value. 

3. 7V = [0..l]. 7t i and n u represent the psychological posture an agent has 
and perceives respectively. It is a single real value between and 1 
inclusive calculated using fuzzy logic. 

6. A is used to identify what messages have been processed. This item helps iden- 

tify old messages allowing agents to keep a record of previous experiences. 

7. Ag RS identifies the sender or intended receiver of the message. 

3.5 Behavior Scripts 

Agent actions are modeled through behavior scripts that are in the form of logic 
rules. JESS is used as the rule engine and is the main component for decision- 
making. JESS scripts can be given to several types of agents for different purposes 
not limited to controlling actions, they may also facilitate tasks such as agent crea- 
tion. Fig. 2 shows an example on how to create four agents using JESS scripts and 
Fig. 3 shows part of what would be an agent's behavior script. 

It is possible to specify more than one script for each Persona agent, this will 
create a cognition agent for each script and each will receive whatever input is 
meant for his or her persona agent. For actions with agents with more than one 
cognition agent, each resulting action is weighted against every other resulting 
action and the most urgent action is chosen. 

4 Study Cases 

Each of the following cases of study focuses on certain features of the Wiinik 
platform that allow us to simulate a variety of scenarios. 
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(assert (Load (Type TimerAgent) 
(Name "kronos"))) 

(assert (Load (Type Person Agent) 
(Name "agent 1") 
(Script ./S cripts/Don Jua ti/Do n Jua n . c lp))) 

(assert (Load (Type Person Agent) 
(Name "agent2") 
(Script ./S cripts/DonJu an/Victim. c lp))) 

(assert (Load (Type Person A gent) 
(Name "agcnt3") 
(Script ./S eripts/Don J uan/ Victim . clp) ) ) 



Fig. 2. Sample script for loading four agents 



(defrute receive- job "receive job" 
?h<-(Hear 

(attended ?a&:( ?a FALSE)) 

(id ?msgld) 

(performative "REQUBST") 

(topic "Work") 

(content ?job) 

(sender ?id)) 
(me (aid ?aid) (bpNode ?bp)) 
=> 

(bind ?jj (assert- string ?job)) 
(modify ?jj (MID ?id)) 
(printout t "Asserting job " ?job crlf)) 



Fig. 3. Sample behavior rule 



4.1 Case 1. "Don Juan" Syndrome 

This case of study is based on the pathology described in [15] and on previous 
work described in [9]. The agents are modeled to follow the typical behavior de- 
scribed in the Don Juan syndrome only when their behavior profiles match and 
they are in the correct ego state. The behavior scripts also implemented the con- 
cept of cathexis [17] which allows the ability to switch between priorities in a 
stable manner. 

The first step in this scenario involves a greeting; this allows each agent to ex- 
change information on how each perceives the other. Based on this information an 
agent will decide how to act toward each other and decides how to proceed with 
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the communication. If an agent is not suitable for one's current needs then no 
further message exchange is done, otherwise the script will continue its course. 
The information used to decide this behavior is contained within the Behavior 
Profile Node Message (BPNM). 





Agent 1 



Agent2 





Greet 


Greet 




"~l Analyze Victim 

Ask rf in a relationship 


fl&m to relationship 


Demand 




Respond demand 


^1 Analyze interest 


End relationship 



I Analyze 
-*A Rescuer 



Fig. 4. Don Juan syndrome case study script flow 



If the agent acting as a Don Juan should decide that the recently greeted agent 
is suitable and is not currently already in a relationship then the next step is to 
propose a relationship to this agent. The second agent would receive the message 
and decide whether or not the first agent is perceived as a savior and if that is what 
is desired at the moment. 

When the agents agree to be in a relationship a series of messages are ex- 
changed in witch the one playing the role of the victim will start demanding things 
from the Don Juan agent, this increase in commitment will reduce the interest the 
Don Juan agent has for the victim and will shortly end with the relationship forc- 
ing the victim to hate the Don Juan agent and this agent will go on to wait for 
another agent to start a relationship with. 

During this scripted behavior many internal values have changed, mainly in the 
perception that each has for each other and themselves that allow this sort of be- 
havior to continue. Any number of things could alter the course and may result in 
the script not being followed to completion if there are any alterations caused by 
external or internal influences. 
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Figure 4 shows an uninterrupted exchange of messages between a Don Juan 
agent and another agent that takes the role of victim. 

4.2 Case 2. A Company's Workers 

When working toward a common goal, it is important for each member to play 
their role and to have shared knowledge on important concepts. The success or 
failure will be decided by the group effort and not by individual achievements. 
This case of study demonstrates how key information can be standardized among 
the group yet given slightly different interpretations or give priorities to different 
parts of the information. 

In this case of study, a kind of agent called Manager must first have a record of 
how many agents type Employees are available to work and how capable they are 
for the job (this is measured by how tired an employee agent is). Each Employee 
agent will contact its Manager agent as soon as possible with that information. 

Once a manager knows how many employees are available, it can then send a 
request message to the environment asking for work. The Environment agent will 
reply with job specifications and await confirmation. Depending on the size of the 
job and how many employees are available for it, a manager must either accept or 
decline this job. If the job is declined then the manager will lose credibility and 
will receive smaller jobs, this is preferred to risking not completing the job on 
time. 

If a job is accepted then a manager will divide the work and give each em- 
ployee a fair share, not necessarily equal shares. Since a manager agent has know- 
ledge on each employee status, it may choose to give consideration to how capable 
or tired each agent is, or disregard this information and assign work to tired em- 
ployees agents. This will cause tired agents to produce many errors and the job 
may take longer to complete. 

When agents are assigned work they will perform their task as best as their cur- 
rent level allows them to. Each employee can have different priorities, some may 
be workaholics, and some may prefer to rest if tired or have different levels for 
deciding when they are tired. If an agent is not within a certain level of health then 
they will have a high probability of producing errors when work is assigned to 
them, and may choose not to work for some time until they consider themselves fit 
for work. Once they finish their workload they report to the manager. 

When a manager receives enough finished tasks to complete the job then this 
information is communicated to the environment agent and it is rated depending 
on the job being completed on time. The cycle then repeats with the manager re- 
questing another job, again considering whether or not the employees are fit to 
complete it or not. 

Figure 5 depicts a simplified case consisting of only one agent of each type. 
Each manager agent is to acquire jobs and decide whether it is capable of finishing 
on time with the given amount of workers. On the other hand, worker agents will 
prioritize their well being before job completion. An environment agent will only 
care whether an accepted job was done on time and how often a job is refused. 
Each agent is a representation of the knowledge that is passed and how each views 
a job in different ways. 
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Fig. 5. Script flow of the company workers study case 



5 Conclusions and Future Work 



Semantic networks can provide richer information when communicating than 
simple instructions given as code or strings. Important additional information can 
be associated with each message and each agent will evaluate the meaning giving 
priority to its very specific needs. This type of information opens the way for dif- 
ferent interpretations and exploring how messages can become unclear in given 
situations or when there is a lack of information needed to interpret the meaning. 

Wiinik provides a flexible platform for experimenting with different behavior 
models and types of interaction. The JADE software provides powerful tools for 
inspection message passing and the JESS rule engine provides useful instructions 
for modeling behaviors. 

Future work will focus on implementing advanced techniques for decisions and 
learning such as different types of neural network architectures and fuzzy systems. 



326 I. Espinoza-Hernandez et al. 

Acknowledgements 

The authors would like to thank CONACYT. The student Ivan Espinoza-Hernandez 
is supported by a scholarship from CONACYT. 



References 

1. Berne, E.: Principles of Group Treatment. Oxford University Press, New York (1964) 

2. Brachman, R.J.: What is-a is and isn't: An analysis of taxonomic links in semantic 
networks. Computer 16(10), 30-36 (1983) 

3. Caire, G., Bellifemine, F.L., Greenwood, D.: Developing Multi-Agent Systems with 
JADE. Wiley, Chichester (2007) 

4. Castanon-Puga, M., Rodriguez-Diaz, A., et al.: Social Systems Simulation Person 
Modeling as Systemic Constructivist Approach. In: Castillo, O., Melin, P., Kacprzyk, 
J., Pedrycz, W. (eds.) Soft Computing for Hybrid Intelligent Systems, vol. 154. 
Springer, Heidelberg (2007) 

5. El-Nasr Seif, M., Yen, J.: Agents, Emotional Intelligence and Fuzzy Logic. In: 
Proceedings of NAFIPS, Florida, US (1998) 

6. Denzinger, J., Hamdan, J.: Improving observation based modeling of other agents us- 
ing tentative stereotyping and compactication through kd-tree structuring. WIAS 4(3), 
255-270 (1999) 

7. FIPA. FIPA Abstract Architecture Specification (2002) 

8. Friedman-Hill, E.J.: Jess, The Java Expert System Shell. Distributed Computing 
Systems, Sandia National Laboratories, Livermore, CA (2000) 

9. Flores, D.L., Rodriguez-Diaz, A., Castro, J.R., Gaxiola, C: TA-Fuzzy Semantic Net- 
works for Interaction Representation in Social Simulation. In: Evolutionary Design of 
Intelligent Systems. SCI, pp. 257-270, 213-225 (2009) 

10. Ghasem-Aghae, N, Kaedi, M., Oren, T.I.: Effects of Cognitive Complexity in Agent 
Simulation: Fuzzy Rules and an Implementation. In: Proceedings of: CM&SC - Con- 
ceptual Modeling and Simulation Conference 2005, within I3M 2005 - International 
Mediterranean Multiconference, Marseille, France, SCS, San Diego, CA, October 
20-22 (2005) 

11. Gilbert, N., Troitzsch, K.G.: Simulation for the Social Scientist. Open University 
Press, Buckingham (1999) 

12. Gorman, D.M., Mezic, J., Mezic, I., Gruenewald, P.J.: Agent-based modeling of drink- 
ing behavior: a preliminary model and potential applications to theory and practice. 
Am. J. Public Health. 96, 2055-2060 (2006) 

13. Majumdar, A., Sowa, J.F., Stewart, J.: Pursuing the goal of language understanding. 
In: Eklund, P., Haemmerle, O. (eds.) ICCS 2008. LNCS (LNAI), vol. 5113, pp. 21^42. 
Springer, Heidelberg (2008) 

14. Minsky, M.: A framework for representing knowledge. Technical report, Cambridge, 
MA, USA (1974) 

15. Novellino, M.: The Don Juan Syndrome: The Script of the Great Losing Lover. 
Transactional Analysis Journal 36(1), 3343 (2006) 

16. Quillian, M.R.: Semantic Memory. MIT Press, Cambridge (1968) 



Agent Communication Using Semantic Networks 327 

17. Rodriguez-Diaz, A., Cristobal-Salas, A., Castanon-Puga, M., Jauregui, C, Gonzalez, 
C: Personality and Behaviour Modelling Based on Cathexis Flux. In: Proceedings of 
the Joint Symposium on Virtual Social Agents AISB 2005: Social Intelligence and 
Interaction in Animals, Robots and Agents, ED. AISB The Society for the Study of Ar- 
tificial Intelligence and the Simulation of Behaviour UH, pp. 130-136. The University 
of Hertfordshire (2005) 

18. Sharma, S., Singh, H., Prakash, A.: Multi-agent modeling and simulation of human 
behavior in aircraft evacuations. IEEE Transactions on Intelligent Transportation 
Systems 44(4), 1477-1488 (2008) 

19. Sowa, J.F.: Conceptual structures: information processing in mind and machine. 
Addison-Wesley Longman Publishing Co., Inc., Boston (1984) 

20. Woollams, S., Brown, M.: TA: the total handbook of transactional analysis. Prentice 
Hall, London (1979) 



Fuzzy Cellular Model for Predator-Prey 
Interaction Applied to the Control of Plagues in a 
Peppers Cropping 

Cecilia Leal-Ramirez 1 , Oscar Castillo 2 , and Antonio Rodriguez-Diaz 1 

1 Universidad Autonoma de Baja California, Tijuana, BC, Mexico 

cecilialr@uabc.mx, ardiaz@uabc.mx 

2 Instituto Teconlogico de Tijuana, Mexico 

ocastillo@haf samx. org 



Abstract. The control of plagues is the regulation and the handling of some species 
referred to as plagues, normally these are species that affect the ecology and the 
economy of a certain location. The search for solutions to the important economic 
incidence of the plagues in the croppings has had an evolution throughout the last 
two decades. In the studies of population dynamics of plagues in croppings, it is 
fundamental to consider the predator-prey systems. We formulated a fuzzy cellular 
model for modeling predator-prey interactions and to study the control of plagues in 
a croppings of pepper affected by Frankliniella species. We consider as predators for 
the biological control of these plagues two species to the Orius species. Our model 
can be used to calculate the introduction of natural enemies (predators) for fight- 
ing against the injurious species (preys). The results show that the proposed model 
works as a tool to achieve plagues control. The combination of predator and prey 
numbers allows us to stabilize the pepper density level on a suitable bio-economic 
equilibrium level. 

1 Introduction 

The control of plagues is the regulation and the handling of some species referred 
as plagues, normally for being species that affect the ecology, the economy, etc. The 
control of plagues is important because the quality of the product and its value in 
the market depend on them. Two typical procedures for the control of plagues exist 
at the moment. The first is the chemical control that uses insecticides that are highly 
effective although the risk exists of contaminating the product with substances 
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detrimental for the human health. The second is the biological control, which 
consists of the use of beneficial insects for the plants, as natural predators. The 
product that is obtained using biological control has a higher value in the market. 
One of the first models to incorporate interactions between predators and prey was 
proposed by [12] and [22]. This model is based on differential equations, and actu- 
ally, the general models, of the predator-prey type, are modifications or extensions 
from these equations. In the present work, we formulated a fuzzy cellular model of 
the predator-prey type to study the control of a plague on a pepper croppings, which 
is defined using the differential equations that characterize the model proposed by 
1 1 2 ] and [22], but expressed in terms of a fuzzy cellular structure plus terms of emi- 
gration and immigration for the predator. In the study of multispecies problems, nor- 
mally one of the most important goals is to find the conditions for the bio-economic 
equilibrium of the resource in order to maximize the revenues earned from them 
maintaining the ecological balance amongst the species [18]. In our study the goal 
is maintaining the pepper cropping yield on a bioeconomically suitable equilibrium 
level combining the predators and preys density in the population dynamics. The 
equilibrium level is defined as the acceptable peppers density after of the damage to 
the cropping produced by the plague during a time interval. The data on the pepper 
croppings were taken from [6] and [3], and the change coefficients that govern the 
predator-prey interaction were taken from [16]. Our interest is in obtaining a char- 
acterization of the fuzzy cellular predator-prey model using fuzzy logic systems 
based on fuzzy sets ll23ll to evaluate the parameters defined as change coefficients 
in the model. This characterization allows us to evaluate the effectiveness of depre- 
dation produced by the calculation of the introduction of natural enemies (Orius 
species) for fighting against the injurious species (Frankliniella species). The basic 
concepts of fuzzy set theory are presented in section 2. The fuzzy cellular predator- 
prey model structure is formulated step by step in section 3. In section 4 a bravely 
description on the predator and prey species which affect to the pepper cropping is 
made. In section 5, the fuzzy logic system is developed using the Matlab fuzzy logic 
toolbox. The results generated by several simulations are shown in section 6, where 
our model shows how ecology, economics and mathematics can be combined into a 
fuzzy cellular predator-prey model to identify management strategies that will make 
the work of plagues control more efficiently done with respect to the models defined 
by using complex differentials equations. Finally, in section 7 the conclusions are 
presented. 

2 Basic Concepts of Fuzzy Sets 

Fuzzy sets are considered elements that have degrees of membership for belonging 
to the set. Fuzzy sets were introduced by [ 23 ] as an extension of the classical notion 
of a set. In classical set theory, the membership of elements in a set is assessed in 
binary terms according to a bivalent condition. By contrast, fuzzy set theory allows 
the gradual assessment of the membership of elements in a set; this is described with 
the aid of a membership function valued in the real unit interval [0, 1]. If X is a set 
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of objects denoted by x and A is a fuzzy set in X, then A is defined as an ordered 
pairs set 

A(x)={(x,Ha)\x&X}, (1) 

where A(x), is called membership function for the fuzzy set A. The construction 
of a fuzzy set depends on the identification of an appropriate universe and the 
specification of a membership function with appropriate linguistic meaning ll23ll . 
Fuzzy set theory can be used in a wide range of domains in which information is 
incomplete or imprecise [8]. In the following section, we will use the fuzzy sets to 
define the change coefficients of the multispecies fuzzy cellular model and to study 
its dynamics. 

3 Fuzzy Cellular Predator-Prey Structure 

The mathematical models that describe predator and prey relationships are used to 
study interactions between two species, when one of them depends on the other for 
food and for survival. Such dynamic relationships between predators and preys are 
prominent areas of study in Ecology 0, ITTIl . The model proposed by 1 12] and [22] 
is the simplest model of interactions between predator and prey, and it is expressed 
as a differential equation system: 

m = rH-aHP 

§=bHP-mP' {) 

where H and P represent the number of preys and of predators respectively, at time 
t , and r — intrinsic rate of preys population increase in the absence of predation, 
a = predation rate, b — reproduction rate of predators, m = predator mortality rate 
in the absence of preys. In the present study, the constants a,b,r and m are called 
'change coefficients" of the classical predator-prey model. Model (2) considers that 
in the absence of predators the number of preys grows exponentially and also that 
in the absence of preys, the number of predators decreases exponentially. The terms 
(—aHP) and (bHP) describe the prey-predator encounters, which are favourable to 
predators and fatal to preys. It assumed that the change in the population density is 
mainly given by the difference of two related terms with the birth and the death of 
the individuals, and that both terms are proportional to the population density |[T3ll 
and to the amount of resources available [21]. 

In general, the model behaviour is unnatural showing no asymptotic stability. 
However, numerous modifications of this model exist, which make it more realis- 
tic and are used today in the analysis of population dynamics. A representation of 
model (2) as a deterministic model is given by 

H(t + At) = H(t) + (r-aP(t))H(t) 

P(t + At) = P(t) + (bH(t) - m)P(t) ' K ' 
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In models (2) and (3) it is frequent to assume that all individuals of the 
population interact with the same probability independently of the abundance, fre- 
quency or space-position of individuals of a particular space. This assumption can 
be more or less justified for concrete cases in animal populations, but this is at 
all unjustified in plant populations due to the competition process [26]. Models in 
space have been proposed, and could be the more significant advance of the recent 
Ecology to understand the role of the local process into population space-temporal 
organization [26]. 

Now we proceed to introduce a space-temporal representation into the equations 
(3) using a cellular structure. This structure is composed by a rectangular region 
of M x N cells. The union of all the cells defines the cellular space in which the 
evolution of each cell depends on its present state and the states of its immedi- 
ate neighboring cells. Let C be a single cell, then C(i,j) is the cell centred at the 
(i, j) coordinates, where 1 < i < M and 1 < j < N, Each cell is a fundamental ele- 
ment for the population density change, reason why the contribution of all the cells 
determines the change of the total population density in the system's dynamics. 
Therefore, modifying expression (3) according to the cellular structure definition, 
we obtain 

Hij(t + At) = Hij(t) + (r- aPij^Hijit) 

Pij(t + At) = Pij(t) + (bH u (t) - m)P u (t) ■ k ' 

A population may increase rapidly from a low level under favorable environmental 
conditions, but this increase in numbers will eventually approach the level where 
resources cannot support the continue increase, this is, indefinite increases in popu- 
lation density do not occur [20]. 

If we take into account that the imprecision and uncertainty are intrinsic concepts 
in bio-systems [8], then we can define the change coefficients, under which the 
populations fluctuate, as functions that depend on two variables characterized as 
fuzzy sets, 



Hij(t + At) = H t j(t) + (r(#y(*) A'W) - a(£y(0,^(0)W)%(0 

Pij(t + At) = Pij(t) + (b(flij(t),Pij(t)) H ij(t) ~ »(#y(0.^(0))fy(0 ' 

where Hjj(t) and Pj(t) are the prey and predator densities respectively located in 
cell (i,j) at time t. 

The prey and predator look for optimal environment conditions, and even though 
all the conditions are not the optimal ones, preys and predators will always try to be 
located into environments where each of these conditions is within tolerable limits 
that allow their normal development. Let n a (i,j) be the neighborhood of the cell 
C(i, j) in terms of a radius a 



H = \c(k,i) 

a I 



max{\k-i\,\l-j\} < a K 
l<k<M;l<l<N, ' ' 
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where a is a positive integer number and (k,l) are the coordinates of another cell 
where the magnitude of the difference between (i,k) and (j,l) does not exceed the 
value of a. The population that emigrates towards the neighboring cells from a cell 
C(i,j) is determined by a factor that is directly proportional to the population size. 
Taking in account that in studies on control of plagues on croppings, the predator 
will emigrate to find preys, and the prey will always have unlimited resource to sur- 
vival, now we retake the equations (5) and (6) to define the emigration considering 
that only the predators can emigrate, 

Hij(t + At) = Hij(t) + rifltjWAiWHift) - aiHijWiPijityPiji^Hijit) 
Pij(t + At) = P,j(t) + b{6 tJ {t),Pi)(f))H i} (t)Pij(t) - m{Hij{t)Aj{t))Pij{t) (7) 
-j3(£y(0.^(0)fy(0+l£(*,*.0 

where Hn(t) > 0, Pij{t) > 0, /3(-) is the emigration function of the predators and 
Yla{k,l,t) is the predators population that immigrates from any neighboring cell 
C(k,l) of the cell C(i,j) to the cell C(z', j). Thus, we have formulated a fuzzy cellular 
model of the predator-prey type (7). 

4 Description of the Predator and Prey Species 

Works on plagues in croppings under plastic have been made as application of the 
predator-prey models [15]. For example, authors in [6l analyzed from the mathe- 
matical point of view, the sufficient conditions for the existence of global attractors, 
for the solutions to the possible differential systems (predator-prey) on two crop- 
pings, pepper and tomato. These croppings are generally affected by three groups of 
plague species of great economic severity: "heliotis of the tomato", "white fly" and 
"trips western of the flowers". Predators for the biological control of these plagues 
species are the Orius and Macrolophus species [6]. In addition, they made adjust- 
ments with their proposed model and analyzed if the adjustment was good by means 
of statistical coefficients. 

We have taken as a case study, the control of the plague known as the 
Frankliniella species commonly denominated "trips western of the flowers" us- 
ing the Orius predator species, in peppers cropping made under the method pro- 
posed by |6|. We assume that the interaction between predator and prey is governed 
by the change coefficients obtained by [16] and not by the ones obtained by (6l 
(see Table I). 



Table 1. Change coefficients values obtained from [ 16 1 



T 


a 


b 


m 


0.012 


0.0003 


0.2 


0.01 
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5 Fuzzy Logic System 

Fuzzy logic was proposed as a generalization of classic logic (23J, |p25ll . and is used 
to model vagueness and uncertainty in the real world. Fuzzy logic systems (FLS) 
have four basic components: an input processor, a fuzzy rules base, a fuzzy inference 
mechanism and an output processor (Fig. 1). 



Fuzzyfi cation 



fuzzy set 



Inference 
Mcchinc 



Dofuzzyficaation 



fuzzy set 



Fig. 1. Basic components of a fuzzy logic system 



In the FLS, the input and output variables have fuzzy sets as values with partial 
overlap, which create qualitative groups of values. Through the linguistic variables 
it is possible to define new fuzzy sets based on the existing sets using the adjectives 
"much", "little", "very", etc [24]. The fuzzy rules have the IF-THEN form, these 
connect hypothesis to conclusions [25 1- The FLS initiates the computation by map- 
ping the input values to the fuzzy sets that characterize the input variables. These 
fuzzy sets are used as input to the inference mechanism through the fuzzy rules. 
These are combined in the inference mechanism to produce a fuzzy output, which 
is mapped to a numerical value by means of a defuzzification process to produce an 
output value. A particular type of fuzzy inference is the Mamdani method. This type 
of inference combines the membership degrees associated with each input value by 
the minimum operator and aggregates the rules through the maximum operator fl4l . 
We now proceed to introduce, step by step, the development of a fuzzy logic system 
to evaluate the change coefficients, using the Matlab fuzzy logic toolbox, with the 
fuzzy inference given by the Mamdani method, and the center of area method for 
the defuzzification. 

It is possible to interpret that the environment of a species is variable by defin- 
ing the change coefficients as fuzzy [ 10]. In our model the predator depends on the 
prey for survival. Taking in to account the model definition (7), we developed one 
fuzzy logic system to evaluate its change coefficients. The system consists of two 
input variables, five output variables and 9 fuzzy rules. The input variables corre- 
spond to the prey and predator densities located in the cell (i,j) at time t, which are 
characterized by three states: Few, Moderate and Much (Fig. 2 and Fig. 3). 
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Fig. 2. Membership functions characterizing the input variable Frankliniella 
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Fig. 3. Membership functions characterizing the input variable Orius 
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VeryHigh 



0.18 0.185 0.19 0.195 0.2 0.205 0.21 0.215 0.22 
OMR 



Fig. 4. Membership functions characterizing the output variable OMR (Orius Mortality Rate) 



There is no defined pattern to determine the membership functions that charac- 
terize the input variables. Therefore, we used the patterns more appropriate for char- 
acterization of the input and output variables. The output variables are the change 
coefficients expressed in model (7). These variables are characterized by five states: 
VeryLow, Low, Moderate, High, VeryHigh (Figs. 5-8). 



VeryHigh 




xlO 



Fig. 5. Membership functions characterizing the output variable FMR (Frankliniella 
Mortality Rate 
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VeryHigh 




0.008 0.0085 0.009 0.0095 0.01 0.0105 0.011 0.0115 0.012 
ONR 



Fig. 6. Membership functions characterizing the output variable ONR (Orius Natality Rate) 



VeryHigh 




0.01 0.0105 0.011 0.0115 0.012 0.0125 0.013 0.0135 0.014 
FNR 



Fig. 7. Membership functions characterizing the output variable FNR (Frankliniella Natality 
Rate) 



The membership functions domain can be changed according to the desired 
behavior. We constructed a total of 9 fuzzy rules to describe the natural behavior 
generated by the interaction between prey and predator when one of them depends 
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VeryHigh 



0.005 0.01 0.015 0.02 0.025 0.03 

OER 



Fig. 8. Membership functions characterizing the output variable OER (Orius Emigration 
Rate) 



on the other for survival. The fuzzy rules set was constructed from the interpretation 
on the life cycle of both species, and are expressed as follows, 

1. If (Frankliniella is Few) and {Orius is Few) then (OMR is VeryHigh) (FMR is 
VeryLow)(ONR is VeryLow)(FNR is VeryHigh)(OER is Normal). 

2. If (Frankliniella is Few) and (Orius is Moderate) then (OMR is VeryHigh)(FMR 
is Low)(ONR is VeryLow)(FNR is High)(OER is High). 

3. If (Frankliniella is Few) and (Orius is Much) then (OMR is VeryHigh)(FMR is 
Normal)(ONR is VeryLow)(FNR is Normal)(OER is VeryHigh). 

4. If (Frankliniella is Moderate) and (Orius is Few) then (OMR is High)(FMR is 
Low)(ONR is Low)(FNR is High)(OER is Low). 

5. If (Frankliniella is Moderate) and (Orius is Moderate) then (OMR is Nor- 
mal)(FMR is Normal)(ONR is Normal)(FNR is Normal)(OER is Normal). 

6. If (Frankliniella is Moderate) and (Orius is Much) then (OMR is Low)(FMR is 
High)(ONR is High)(FNR is Low)(OER is High). 

7. If (Frankliniella is Much) and (Orius is Few) then (OMR is Normal)(FMR is 
High)(ONR is VeryHigh)(FNR is Low)(OER is VeryLow). 

8. If (Frankliniella is Much) and (Orius is Moderate) then (OMR is Low)(FMR is 
VeryHigh)(ONR is Normal)(FNR is VeryLow)(OER is Low). 

9. If (Frankliniella is Much) and (Orius is Much) then (OMR is VeryLow)(FMR 
is VeryHigh)(ONR is VeryLow)(FNR is VeryLow)(OER is Normal). 

Where FNR = r, FMR = a, OMR = b, ONR = m, OER = j3 . 
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6 Simulation Results 

In the work of [6] the experimental data were obtained from a pepper cropping 
with area of 2000m 2 , which we represent by cells in the present work. Each cell 
represents 20m 2 of the total area, and there are two pepper plants on lm 2 . Therefore, 
there are 40 pepper plants on 20m 2 and 4000 pepper plants on 2000m 2 . Our goal is 
studding the damage of the pepper generated by the plague (Frankliniella) using 
model (7) and the method of [6] under the hypothesis that the interaction between 
Orius-Frankliniella is governed by the change coefficients shown in Table 1 . 

Model (7) describes the dynamics of the biological system in which two species 
interact, the predator and the prey (see Fig. 9 and Fig. 10). 
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Fig. 9. Prey and predator densities generated by the model (7) 



Fig. 9 shows the changes in the prey and predator densities, while in Fig. 10 we 
show the closed lines formed by the prey and predator densities, which demonstrate 
that the model (7) has no asymptotic stability like model (2), it does not converge to 
an attractor. Nevertheless, in nature, the populations tend to the regulation or asymp- 
totic stabilization [ 7 ] . To control the plagues in croppings it is not necessary to know 
the interaction between prey and predator indefinitely, because the croppings have 
a time interval wherein the resource grows until its harvest. The harvest period in 
a production cycle under a greenhouse is from 4 to 7 months |3|. Therefore, in the 
present study, we consider only a time interval of [1 — 30] weeks to generate the 
interaction between the species Orius and Frankliniella, using model (7). Although 
this model has no asymptotic stability, the interaction between prey and predator 
will determine the damage level produced by the prey density on the plant in each 
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Fig. 10. Closed lines formed by the prey and predator densities generated by the model (7) 



cell. We expect that the damage on the cropping tends to the asymptotic stabil- 
ity converging towards an attractor, which should be on a suitable bioeconomically 
equilibrium Q. The bioeconomical equilibrium is defined as the cropping produc- 
tion acceptable after of the damage generated by the plagues on a time interval. 
Hypothetically we have considered the damage caused by a prey of 1 x 10~ 6 % on 
one plant, per week. 

The results are presented as averages obtained by 20 simulations, and these are 
classified in 4 case studies, each one determines a region on the total area wherein an 
initial number of preys is distributed, simulating the contaminated area by plagues 
(Tables 2, 3, 4 and 5), and the initial number of predators is distributed on the total. 



Table 2. Damage produced by the preys on the cropping of pepper plants, considering the 
10% on the total area as the contaminated region by preys at the beginning 



Frankliniella \ Orius 



20 x Cell 



30 x Cell 



40 x Cell 



50 x Cell 



60 x Cell 



70 x Cell 



[0 - 20] x Cell 
[10-20] x Cell 
[30-40]x Cell 



0.00216 
0.00342 
0.00474 



0.00222 
0.00294 
0.00384 



0.00191 
0.00254 
0.00323 



0.00136 
0.00217 
0.00279 



0.00135 
0.00184 
0.00242 



0.00106 
0.00155 
0.00207 



We only will present in the graphics the two more representatives combinations 
per each Table, to show how the interaction produced by preys and predators, deter- 
mines the asymptotic growth of the damage caused by the preys on the plants per 
cell. 
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Fig. 11 and Fig. 13 show the interactions between preys and predator generated 
by model (7) using the combination of the prey and predator initial number shown 
in Table 2. The Fig. 12 and Fig. 14 show the damage caused by the preys on the 
plants per cell, which is generated by Fig. 1 1 and Fig. 13 respectively. 
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Fig. 11. Prey and predator densities generated by the model (7) using the combination of 20 
Orius per cell and from 30 to 40 Frankliniella per cell. The preys were initially distributed on 
10% of the total area (see Table 2). 



xio 




Fig. 12. Damage caused by the preys on the plants per cell. The preys were initially distributed 
on 10% of the total area, using the combination of 20 Orius per cell and from 30 to 40 
Frankliniella per cell (see Table 2). 
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Fig. 13. Prey and predator densities generated by the model (7) using the combination of 70 
Orius per cell and from 30 to 40 Frankliniella per cell. The preys were initially distributed on 
10% of the total area (see Table 2). 
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Fig. 14. Damage caused by the preys on the plants per cell. The preys were initially distributed 
on 10% of the total area, using the combination of 70 Orius per cell and from 30 to 40 
Frankliniella per cell (see Table 2). 
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Fig. 15. Prey and predator densities generated by the model (7) using the combination of 20 
Orius per cell and from 30 to 40 Frankliniella per cell. The preys were initially distributed on 
20% of the total area (see Table 3). 



Table 3. Damage produced by the preys on the cropping of pepper plants, considering the 
20% on the total area as the contaminated region by preys at the beginning 



Frankliniella \ Orius 



20 x Cell 



30 x Cell 



40 x Cell 



50 x Cell 



60 x Cell 



70 x Cell 



[0 - 20] x Cell 
[10-20]x Cell 
[30-40]x Cell 



0.00515 
0.00696 
0.00984 



0.00446 
0.00619 
0.00774 



0.00355 
0.00491 
0.00642 



0.00272 
0.00427 
0.00551 



0.00222 
0.00373 
0.00479 



0.00241 
0.00322 
0.00421 



Fig. 15 and Fig. 17 show the interactions between preys and predator generated 
by model (7) using the combination of the prey and predator initial number shown 
in Table 3. Fig. 16 and Fig. 18 show the damage caused by the preys on the plants 
per cell, which is generated by Fig. 15 and Fig. 17 respectively. 



Table 4. Damage produced by the preys on the cropping of pepper plants, considering the 
30% on the total area as the contaminated region by preys at the beginning 



Frankliniella \ Orius 



20 x Cell 



30 x Cell 



40 x Cell 



50 x Cell 



60 x Cell 



70 x Cell 



[0 - 20] x Cell 
[10-20]x Cell 
[30-40]x Cell 



0.00777 
0.01053 
0.01427 



0.00679 
0.00912 
0.01165 



0.00526 
0.00738 
0.00965 



0.00446 
0.00619 
0.00828 



0.00361 
0.00557 
0.00720 



0.00359 
0.00482 
0.00631 
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Fig. 16. Damage caused by the preys on the plants per cell. The preys were initially distributed 
on 20% of the total area, using the combination of 20 Orius per cell and from 30 to 40 
Frankliniella per cell (see Table 3). 
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Fig. 17. Prey and predator densities generated by the model (7) using the combination of 70 
Orius per cell and from 30 to 40 Frankliniella per cell. The preys were initially distributed on 
20% of the total area (see Table 3). 
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Fig. 18. Damage caused by the preys on the plants per cell. The preys were initially distributed 
on 20% of the total area, using the combination of 70 Orius per cell and from 30 to 40 
Frankliniella per cell (see Table 3). 
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Fig. 19. Prey and predator densities generated by the model (7) using the combination of 20 
Orius per cell and from 30 to 40 Frankliniella per cell. The preys were initially distributed on 
30% of the total area (see Table 4). 



346 C. Leal-Ramirez, O. Castillo, and A. Rodriguez-Diaz 



0.015 



0.01 



Q 0.005 




Fig. 20. Damage caused by the preys on the plants per cell. The preys were initially distributed 
on 30% of the total area, using the combination of 20 Orius per cell and from 30 to 40 
Frankliniella per cell (see Table 4). 
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Fig. 21. Prey and predator densities generated by the model (7) using the combination of 70 
Orius per cell and from 30 to 40 Frankliniella per cell. The preys were initially distributed on 
30% of the total area (see Table 4). 



Fig. 19 and Fig. 21 show the interactions between preys and predator generated 
by model (7) using the combination of the prey and predator initial number shown 
in Table 3. Fig. 20 and Fig. 22 show the damage caused by the preys on the plants 
per cell, which is generated by Fig. 19 and Fig. 20 respectively. 
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Fig. 22. Damage caused by the preys on the plants per cell. The preys were initially distributed 
on 30% of the total area, using the combination of 70 Orius per cell and from 30 to 40 
Frankliniella per cell (see Table 4). 




Fig. 23. Prey and predator densities generated by the model (7) using the combination of 20 
Orius per cell and from 30 to 40 Frankliniella per cell. The preys were initially distributed on 
100% of the total area (see Table 5). 



Fig. 23 and Fig. 25 show the interactions between preys and predator generated 
by model (7) using the combination of the prey and predator initial number shown 
in Table 3. Fig. 24 and Fig. 26 show the damage caused by the preys on the plants 
per cell, which is generated by Fig. 23 and Fig. 25 respectively. 
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Fig. 24. Damage caused by the preys on the plants per cell. The preys were initially distributed 
on 100% of the total area, using the combination of 20 Orius per cell and from 30 to 40 
Frankliniella per cell (see Table 5). 
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Fig. 25. Prey and predator densities generated by the model (7) using the combination of 70 
Orius per cell and from 30 to 40 Frankliniella per cell. The preys were initially distributed on 
100% of the total area (see Table 5). 



Inside certain limits, the increment in the population's density causes decreases 
in the yield for the plants (TJ, Q9), H, El, Qj]. In the work presented by Q, the 
authors made a study on the yield of a cropping combining the population's density 
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Fig. 26. Damage caused by the preys on the plants per cell. The preys were initially distributed 
on 100% of the total area, using the combination of 70 Orius per cell and from 30 to 40 
Frankliniella per cell (see Table 5). 



per m 2 , the early pruning of the terminal buds and the length of harvest period. Their 
results showed that the cropping total yield is of 6.27 peppers per plant with no 
pruning. According to this result, we present the cropping total yield associated to 
the area initially affected by the plague and considering that the yield of a plant is of 
8 peppers (see Tables 2-5). We would expect to have a total yield of 32000 peppers 
in case of that the cropping does not have plague during 30 weeks. Nevertheless, all 
the croppings tend to have plagues. Therefore the croppings must be protected of 
plagues using methods as the biological control to maximize their yield. 



Table 5. Damage produced by the preys on the cropping of pepper plants, considering the 
100% on the total area as the contaminated region by preys at the beginning 



Frankliniella \ Orius 



20 x Cell 



30 x Cell 



40 x Cell 



50 x Cell 



60 x Cell 



70 x Cell 



[0 - 20] x Cell 
[10-20]x Cell 
[30-40]x Cell 



0.0240 
0.0354 
0.0478 



0.0214 
0.0299 
0.0389 



0.0183 
0.0254 
0.0324 



0.0141 
0.0218 
0.0278 



0.0111 
0.0185 

0.0243 



0.0121 
0.0160 
0.0212 



The agriculture under greenhouse conditions requires producing croppings of 
commercial high value and that are economically portable. Therefore its objective 
is to maximize the yield of the croppings, which implies that the yield of crop- 
ping must stay on an equilibrium level economically portable. The equilibrium level 
is defined as the acceptable peppers density after of the damage produced by the 
plague during a time interval. For the case study presented in this work, we could 
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determine the suitable procedure to control the plague on the desired yield of the 
Table 6. For example, if we want to have a yield above 80% then we must take into 
account from 20 to 70 predators to fight the dispersed prey on 10% of the total area 
(see Table 6). For the case of 20% affected by plague, we must consider more than 
70 predators to fight the plague. Thus, we will be able to maintain the cropping yield 
on an equilibrium level of 80%. 

Table 6. Cropping yield according to the area affected by the plague 



Area(%) 


Orius 


Frankliniella 


Damage Area (%) 


Harvest Peppers (%) 


10 


20 


[30-40] 


0.47 


99.52 


10 


70 


[30-40] 


0.21 


99.79 


20 


20 


[30-40] 


0.99 


99.01 


20 


70 


[30-40] 


0.43 


99.57 


30 


20 


[30-40] 


1.43 


98.57 


30 


70 


[30-40] 


0.64 


99.36 


100 


20 


[30-40] 


4.78 


95.22 


100 


70 


[30-40] 


2.12 


97.88 



7 Conclusion 

We have modeled the predator-prey dynamics without using explicit complex differ- 
ential equations applied to control of plagues. Our model calculates the introduction 
of natural enemies (predators) for fighting against the injurious species (preys). We 
considered the change coefficients from [16] to define the predator-prey interaction. 
The results show that the proposed model works as a tool to plagues control. The 
combination of predator and prey numbers allows us to stabilize the resource level 
on a suitable bio-economic equilibrium level. The use of the model to evaluate the 
effectiveness of predation remains to be developed. Nevertheless, which is observed, 
at the moment is that the present model can be useful in this evaluation, that consti- 
tutes a problem in all the processes of biological control. We consider that the use 
of fuzzy logic is a great contribution to the construction of models as the one pro- 
posed in the present work. Fuzzy theory allows modeling many forms of biological 
organization in a more realistic way [9]. In addition, the input and output sets of the 
fuzzy systems can be easily constructed with the help of specialists in the field, that 
is, a specialist will know when the population of a particular species is small, large, 
and so forth. 
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Abstract. This chapter introduces copula functions and the use of the Gaus- 
sian copula function to model probabilistic dependencies in supervised classi- 
fication tasks. A copula is a distribution function with the implicit capacity to 
model non linear dependencies via concordance measures, such as Kendall's 
r. Hence, this chapter studies the performance of a simple probabilistic classi- 
fier based on the Gaussian copula function. Without additional preprocessing 
of the source data, a supervised pixel classifier is tested with a 50-images 
benchmark; the experiments show this simple classifier has an excellent 
performance. 

Keywords: Gaussian copula, supervised classification. 

1 Introduction 

In Pattern Recognition applications many algorithms and models have been 
proposed for many tasks, specially for clustering, regression and classification. 
Applications in which a training data set with categories and attributes is 
available and the goal is to assign a new object to one of a finite number 
of discrete categories are known as supervised classification problems [21 1121 
115) . In this work we present the use of the Gaussian copula function as an 
alternative for modeling dependence structure in a supervised probabilistic 
classifier. 

Copula functions are suitable tools in statistics for modeling multiple de- 
pendence, not necessarily linear dependence, in several random variables. For 
this reason, copula functions have been widely used in economics and finance 
IU [23 HB] . More recently copula function have been used in other fields 
such as climate [22], oceanography [6], hydrology JO], geodesy pQ, reliability 
|17j . evolutionary computation [20, 21 and engineering [11 j . By using copula 
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theory, a joint distribution can be built with a copula function and, possibly, 
several different marginal distributions. Copula theory has been used also for 
modeling multivariate distributions in unsupervised learning problems such 
as image segmentation [H [8] and retrieval tasks [16l EU [24] • In [13] , the bi- 
variate copula functions Ali-Mikhail-Haq, Clayton, Frank and Gumbel are 
used for unsupervised classification. These copulas are well defined for two 
variables but when extended to three or more variables several complications 
arise (for instance, undefined copula parameters) , preventing their generaliza- 
tion and applicability. For the Gaussian copula however, there exist a simple 
"general formula" for any number of variables. This work introduces the use 
of Gaussian copula in supervised classification, and compares an independent 
probabilistic classifier with a copula-based probabilistic classifier. 

The content of the chapter is the following: Sectionals a short introduction 
to copula functions, Section [3] presents a copula based probabilistic model for 
classification. Section [4] presents the experimental setting to classify an image 
database, and Section [5] summarizes the conclusions. 

2 Copula Functions 

The copula concept was introduced 50 years ago by Sklar [53] to separate 
the effect of dependence from the effect of marginal distributions in a joint 
distribution. Although copula functions can model linear and nonlinear de- 
pendencies, they have been barely used in computer science applications 
where nonlinear dependencies are common and need to be represented. 

Definition 1. A copula C is a joint distribution function of standard uniform 
random variables. That is, 

C{ui ,...,Ud) = P(Ui <ui,...,U d <Ud) , 

where Ui ~ E/(0, 1) for i = 1, . . . , d. 

For a more formal definition of copula functions, the reader is referred to 
[TUfTS]. The following result, known as Sklar's theorem, states how a copula 
function is related to a joint distribution function. 

Theorem 1 (Sklar's theorem). Let F be a d-dimensional distribution 
function with marginals F\, F 2 , ■ • ■ , Fd, then there exists a copula C such that 
for all x in R , 

F(xi,x 2 ,.--,Xd) = C(Fi(xi),F 2 (x 2 ), ■ • • ,F d (x d )) , 

where R denotes the extended real line [— 00, 00]. If F\(x\), F 2 (x 2 ), . . . , Fd{xd) 
are all continuous, then C is unique. Otherwise, C is uniquely determined on 
Ran(Fi) x Ran(F 2 ) x • ■ • x Ran(Fd), where Ran stands for the range. 
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According to Theorem [TJ any joint distribution function F with continuous 
marginals F\, F 2l ■ ■ . , F d has associated a copula function C. Moreover, the 
associated copula C is a function of the marginal distributions F\, F2, . . . , F<j. 
An important consequence of Theorem Q] is that the d-dimensional joint 
density / and the marginal densities fi, f 2 , ■ ■ ■ , fd are also related: 

d 
f(x u ...,x d )=c(F 1 (x 1 ),...,F d {x d ))-Y[f i (x i ) , (1) 

i=i 

where c is the density of the copula C. The Equation ([1]) shows that the 
product of marginal densities and a copula density builds a d-dimensional 
joint density. Notice that the dependence structure is given by the copula 
function and the marginal densities can be of different distributions. This 
contrasts with the usual way to construct multivariate distributions, which 
suffers from the restriction that the marginals are usually of the same type. 
The separation between marginal distributions and a dependence structure 
explains the modeling flexibility given by copula functions. 

2.1 Gaussian Copula Function 

There are several parametric families of copula functions, such as Student's 
t copula and Archimedean copulas. One of these families is the Gaussian 
copula function. 

Definition 2. The copula associated to the joint standard Gaussian 
distribution is called Gaussian copula. 

According to Definition [2] and TheoremQ] if the d-dimensional distribution of 
a random vector {Z\. . . . , Z d ) is a joint standard Gaussian distribution, then 
the associated Gaussian copula has the following expression: 



c($(z 1 ),...Mza);£)= I •••/ ^ (J^/^J 1/a **"*i 



* -1 («i) ri>-\u d ) j-t'E-H 



or equivalently, 



C(u u ...,u d ;S) = J^ ...J_^ ———.to... & , 

where <P is the cumulative distribution function of the marginal standard 
Gaussian distribution and £ is a symmetric matrix with main diagonal of 
ones. The elements outside the main diagonal of matrix S are the pairwise 
correlations pij between variables Zi and Zj, for i, j — l,...,d and i ^ j. 
It can be noticed that a d-dimensional standard Gaussian distribution has 
mean vector zero and a correlation matrix S with d(d — l)/2 parameters. 
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Algorithm 1 Pseudocode for estimating parameters 

1: for each random variable Xi, i = 1, . . . ,d, estimate its marginal distribution 
function F using the observed values Xi. The marginal distribution function 
can be parametric or nonparametric 

2: determine Ui = Fi(xi), for i — 1, . . . , d 

3: calculate z% — <&~ 1 (ui) where $ is the cumulative standard Gaussian distribu- 
tion function, for i — 1, . . . , d 

4: estimate the correlation matrix E for the random vector (Z\,. . . ,Zd) using 
pseudo observations (z\ , . . . , Zj) 



The dependence parameters pij of a d-dimensional Gaussian copula can 
be estimated using the maximum likelihood method. To do so, we follow the 
steps of Algorithm [TJ 

Due to Equation JT]), the d-dimensional Gaussian copula density can be 
calculated as: 



z'£- 



c(*(zi),...,$(z d y,z) = 



(27r)< d / 2 )|i:| 1 /2 



_I Z 2 

-e 2 i 



v\ d i 

1 g-K^ 1 --^ . (2) 



\E\V 2 



Given that a Gaussian copula is a distribution function it is possible to simu- 
late data from it. The main steps are the following: once a correlation matrix 
E is specified, a data set can be generated from a joint standard Gaussian 
distribution. The next step consists of transforming this data set using the 
cumulative distribution function <P. For random vectors with a Gaussian cop- 
ula associated to their joint distribution, the first step is to generate data 
from the copula and then determining their quantiles by means of their cu- 
mulative distribution functions. Algorithm fj] and FiguresQ],[2]and[3]illustrate 
the sampling procedure for different correlations. 



Algorithm 2. Pseudocode for generating data with Gaussian dependence 

structure 

1: simulate observations (zi, ... , Zd) from a joint standard Gaussian distribution 

with matrix correlation E 
2: calculate Ui = #(«j) where $ is the cumulative standard Gaussian distribution 

function, for i = 1, . . . , d 
3: determine Xi using quasi-inverse F~ (ui), where Fi is a cumulative distribution 
function, for i — 1, . . . , d 



Figure [U-(a) shows 500 bivariate data with correlation p = —0.5 drawn 
from a bivariate standard Gaussian distribution (step 1, Algorithm [2|) . The 
histogram on the vertical axis and the histogram on the horizontal axis 
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Fig. 1. (a) A sample of 500 points from a standard Gaussian distribution with 
parameter p — —0.50. (b) The corresponding sample for a Gaussian copula, (c) 
The associated sample for a joint distribution with marginal Beta distributions with 
parameters (1,2) (histogram on the horizontal axis) and (0.5,0.5) (histogram on 
the vertical axis) . (d) The associated sample for a joint distribution with marginal 
t-Student distribution with 8 degrees of freedom (histogram on the horizontal axis) 
and marginal exponential distribution with mean 4 (histogram on the vertical axis). 



illustrate that both marginals arc univariate standard Gaussian distribu- 
tions. This data set is used to obtain a sample from a Gaussian copula, as 
shown in Figure [U-(b) (step 2, Algorithm [2]). Both histograms illustrate that 
marginals are uniform, according to Definition [TJ Figured- (c) shows a sample 
from a joint distribution with Gaussian copula and Beta marginals (step 3, 
Algorithm [5]) . This sample is obtained using the data set of Figure [TJ-(b). 
Figure [U(d) shows a sample from a joint distribution with Gaussian cop- 
ula, Student's t marginal distribution and exponential marginal distribution 
(step 3, Algorithm [2|) . This sample is also obtained from the data set of 
Figure Q~|-(b). In order to appreciate how the correlation parameter modifies 
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Fig. 2. (a) A sample of 500 points from a standard Gaussian distribution with 
parameter p — —0.70. (b) The corresponding sample for a Gaussian copula, (c) 
The associated sample for a joint distribution with marginal Beta distributions with 
parameters (1,2) (histogram on the horizontal axis) and (0.5,0.5) (histogram on 
the vertical axis) . (d) The associated sample for a joint distribution with marginal 
t-Student distribution with 8 degrees of freedom (histogram on the horizontal axis) 
and marginal exponential distribution with mean 4 (histogram on the vertical axis). 



the dependence structure, Figures [5] and [3] show the same information as 
Figure Q] with p = —0.7 and p = —0.95, respectively. 

Although the correlation is used to generate data from a Gaussian copula, 
it is not necessary the same for joint distributions with Gaussian copula 
and non-Gaussian marginals. However, the data sets in Figure [T] have the 
same concordance value measured in Kendall's r. This important result 
for parametric bivariate copulas (see [IB]) is explained through the equation: 



t(X u X 2 )=4 f f 
Jo Jo 



C(ui,u 2 ; 6)dC(ui,u 2 ; 9) — I 



(3) 



Using Gaussian Copulas in Supervised Probabilistic Classification 361 







(a) 



(I)) 







(c) 



(d) 



Fig. 3. (a) A sample of 500 points from a standard Gaussian distribution with 
parameter p — —0.95. (b) The corresponding sample for a Gaussian copula, (c) 
The associated sample for a joint distribution with marginal Beta distributions with 
parameters (1,2) (histogram on the horizontal axis) and (0.5,0.5) (histogram on 
the vertical axis) . (d) The associated sample for a joint distribution with marginal 
t-Student distribution with 8 degrees of freedom (histogram on the horizontal axis) 
and marginal exponential distribution with mean 4 (histogram on the vertical axis). 



which relates the dependence parameter 9 of a copula and Kendall's r. For 
a bivariate Gaussian copula, Equation ([3]) can be written as 



t = — arcsin(,o) 

n 



(4) 



Given that is well established how to estimate correlation matrixes, eval- 
uate densities, and calculate integrals for the multidimensional Gaussian 
distribution, the Gaussian copula function is relatively easy to implement. 
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3 The Probabilistic Classifier 

As noted, the aim of this work is to introduce the use of Gaussian copula 
functions in supervised classification. According to Thcorem[TJ we can employ 
a copula function in a probabilistic classifier, such as a Bayessian classifier. 
In this section we present a three dimensional probabilistic model based on 
three empirical distribution functions and a trivariate dimensional Gaussian 
copula function. 

The Bayes' theorem states the following: 

f|t .>|..,) = f ' B °"^7° t ' l ,5) 

where P(K = k\E = e) is the posterior probability, P(E = e\K = k) is the 
likelihood function, P(K = k) is the prior probability and P(E = e) is the 
data probability. 

The Equation ([5]) has been used as a tool in supervised classification. A 
probabilistic classifier can be designed comparing the posterior probability 
that an object belongs to class K given its attributes E. The object is then 
assigned to the class with the highest posterior probability. For practical rea- 
sons, the data probability P(E) does not need to be evaluated for comparing 
posterior probabilities. Furthermore, the prior probability P{K) can be sub- 
stituted by an uniform distribution if the user does not have an informative 
distribution. 

3.1 The Probabilistic Classifier Based on Gaussian Copula 
Function 

For continuous attributes, a Gaussian copula function can be used for mod- 
eling the dependence structure in the likelihood function. In this case, the 
Bayes' theorem can be written as: 

p , K = fc|p) = c(Fi( ei ), . . . ,F n (e n )|fc, g) x Uti /iN*Q x PjK = k) 

/(ei,...,e„) 

(6) 
where Fi and fi are the marginal distribution functions and the marginal 
densities of attributes, respectively. The function c is a d-dimensional Gaus- 
sian copula density defined by Equation ©. As can be seen in Equation ©, 
each category determines a likelihood function. 

3.2 The Probabilistic Classifier Based on Independent Model 

By considering conditional independence among the attributes in Equation 
([6|) , or equivalently, an independent structure in the likelihood function given 
a category, a probabilistic classifier can use the following expression in order 
to calculate posterior probabilities: 
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P(K = k\e) = ^f { ^^ = k) ■ W 

Equation (JT)) uses an independent structure given by a copula density equals 
to one. This independent copula density can be also obtained by a Gaussian 
copula function when matrix S is the identity matrix /. 

3.3 An Application Example 

Consider the following specific classification problem: assign a pixel to a cer- 
tain class according to its color attributes. If we have information about the 
color distribution of each class, then we can use this information and the 
Bayes' theorem in order to classify new pixels. This is an example of super- 
vised classification. For a red-green-blue (RGB) color space and two classes, 
a Gaussian copula based classifier can be written as 



p , fcl ,, _ c(F R (r),F G (g),F B (b)\k,E)f R (r\k)f G {g\k)f B (b\k) x P(k) 

( |r ' 5 ' )- W^J) ' 

(8) 
where c is a trivariate Gaussian copula density. 

In order to classify a pixel, we use in Equation ([5]) a prior probability 
P{K — k) based on the uniform distribution, nonparametric marginal den- 
sities / based on histograms to approximate fn{r\k), fa(g\k) and /s(6|fc), 
and nonparametric marginal distributions F based on empirical cumulative 
distribution functions to approximate F R {r) 1 Fc(g) and FbQ>). For modeling 
the dependence structure of the likelihood function f(r, g, b\k) we present the 
use of a trivariate Gaussian copula function. 



4 Experiments 

We use two probabilistic models in order to classify pixels of 50 test im- 
ages. The first model is an independent probabilistic model (I-M) based on 
the product of marginal distributions. The second model is a copula-based 
model (GC-M) that takes into account a dependence structure by means of a 
trivariate Gaussian copula. The image database was used in [5] and is avail- 
able online [37] • This image database provides information about two classes: 
the foreground and the background. The training data and the test data are 
contained in the labelling-lasso files [2TJ , whereas the correct classification is 
contained in the segmentation files. Figures EJ [5] and |6] show the description of 
three images from the database. Table [3] shows a description for each image. 
Although the database is used for segmentation purposes, the aim of this 
work is to introduce the use of the Gaussian copula function in supervised 
color pixel classification. We use the information for supervised color pixel 
classification, without taking into account the spatial information. 
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Fig. 4. (a) The color image, (b) The labelling-lasso image with the training 
data for background (dark gray), for foreground (white) and the test data (gray). 

(c) The correct classification with foreground (white) and background (black). 

(d) Classification made by I-M. (e) Classification made by GC-M. 




Fig. 5. (a) The color image, (b) The labelling-lasso image with the training data 
for background (dark gray), for foreground (white) and the test data (gray), (c) The 
correct classification with foreground (white) and background (black), (d) Classifi- 
cation made by I-M. (e) Classification made by GC-M. 



Two evaluation measures are used in this work: accuracy and Tanimoto 
coefficient. The accuracy is described in Figure [7] We define the positive class 
as foreground and the negative class as background. 

The Tanimoto coefficient (TC) is also known as Jaccard similarity measure. 
This measure, TC, is defined as: 



TC{k) 



Vmng(k) 
V m Ug{k) 
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Fig. 6. (a) The color image, (b) The labelling-lasso image with the training 
data for background (dark gray), for foreground (white) and the test data (gray). 

(c) The correct classification with foreground (white) and background (black). 

(d) Classification made by I-M. (e) Classification made by GC-M. 
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Fig. 7. (a) A confusion matrix for binary classification, where tp are true positive, 
fp false positive, fn false negative, and tn true negative counts, (b) Definition of 
accuracy used in this work. 



where V m n g {k) denotes the number of pixels classified as class k by both 
the model and the ground truth and V m \j g {k) denotes the number of pixels 
classified as class k by either the model or the ground truth. 



4.1 Numerical Results 

In Table [T] we summarize the measure values reached by the independent 
probabilistic model (I-M) and the copula-based model (GC-M). The infor- 
mation about the number of pixels well classified for each class is reported in 
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Table [31 We include in Table H] the performances of I-M and GC-M for each 
image. 

Table 1. Descriptive results for all evaluation measures. BG stands for the back- 
ground class and FG stands for the foreground class. 



Measure 


Minimum Median Mean Maximum Std. deviation 


I-M 


Tanimoto coefficient - 


-BG 


0.369 


0.690 


0.695 


0.955 


0.143 


Tanimoto coefficient - 


-FG 


0.341 


0.633 


0.638 


0.953 


0.168 


Accuracy 


0.571 


0.792 


0.795 


0.976 


0.107 


GC-M 


Tanimoto coefficient - 


-BG 


0.450 


0.816 


0.797 


0.976 


0.118 


Tanimoto coefficient - 


-FG 


0.375 


0.780 


0.758 


0.972 


0.141 


Accuracy 


0.587 


0.889 


0.871 


0.987 


0.083 



To properly compare the performance of the probabilistic models, we 
conducted a hypothesis test based on a Bootstrap method for the differences 
between the means of accuracy and Tanimoto coefficients, for both proba- 
bilistic models. Table [5] shows the confidence interval for the means, and the 
corresponding p- value. 



Table 2. Results for the difference between evaluation measure means in each 
model. A 95% confidence interval and a p-value are obtained through a Bootstrap 
technique. BG stands for the background class and FG stands for the foreground 
class. 



Measure 


95% Interval 


p-value 


Tanimoto coefficient - BG 


-1.52E-01 


-5.18E-02 


2.67E-04 


Tanimoto coefficient - FG 


-1.80E-01 


-5.96E-02 


3.67E-04 


Accuracy 


-1.14E-01 


-3.94E-02 


2.67E-04 



4.2 Discussion 

According to Table [TJ the GC-M shows the best behaviour for all evalua- 
tion measures. For instance, the mean accuracy for the I-M, 79.5%, is less 
than the mean accuracy for the GC-M, 87.1%. This means that using a I-M 
approximately has 8% more error rate than using a GC-M. 

The average of the Tanimoto coefficient for the background class is greater 
than the average of the Tanimoto coefficient for the foreground class, for 
both I-M and GC-M (see Table Q]) . These coefficients are shown for each 
image in Figure [8} (a) and Figure HJ-(b). Notice the Tanimoto coefficients for 
GC-M on the background and foreground are very similar, denoting a better 
classification than I-M. In most of the images and for each class, we can see 
in Figure [3-(c) and [51(d) (also in Table g} that GC-M outperforms I-M. In 
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average, according to Table [TJ the GC-M improves the I-M in both classes. 
For the foreground class from 63.8% to 75.8%, and for the background class 
from 69.5% to 79.7%. 



1 

1 3- 





(a) 



(b) 





(c) 



(d) 



Fig. 8. Tanimoto coefficient for the supervised pixel classification on the 50 images 
of [3] . Image order is the same as in Table [5] (a) Background (dashed line) and 
foreground (solid line) for the I-M. (b) Background (dashed line) and foreground 
(solid line) for the GC-M. (c) I-M (dashed line) and GC-M (solid line) for the 
background class, (d) I-M (dashed line) and GC-M (solid line) for the foreground 
class. 



Table Q] also shows information about the standard deviations for each 
evaluation measure. For all cases, the standard deviation indicates that using 
a GC-M in pixel classification is more consistent than using an I-M. 

In order to statistically compare the performance of the probabilistic mod- 
els, Tabled] shows confidence intervals and p- values that confirm differences 
between the models. None of confidence intervals include the value and all 
p- values are less than a — 0.05. 
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5 Conclusions 

In this work we introduce the use of Gaussian copulas in supervised pixel 
classification. According to numerical experiments the selection of a Gaus- 
sian copula for modeling structure dependence can help achieve better clas- 
sification results. An specific example is the image 227092, which appears in 
Figure [SJ its accuracy for the I-M classifier is 57.1%, whereas its accuracy for 
the GC-M classifier is 89.5%. For this image, the Gaussian copula improves 
its accuracy. 

Although we model the dependence structure for each image with the same 
copula function, this is not necessary. There are many copula functions and 
the Gaussian copula has been chosen due to its practical usefulness and easy 
implementation. However, having more than one copula at hand may im- 
prove the performance of the copula-based classifier. In such case, a copula 
selection procedure is necessary. The evaluation results are the consequence 
of the selected dependence structure and marginals. For instance, on the im- 
age 106024, Figure |31 the performance of the I-M classifier is 57.6% accurate 
(accuracy), whereas the GC-M classifier is 58.7% accurate. For most applica- 
tions better results can be obtained by selecting the best fitted copula func- 
tion from a set of available copulas. For example, in the experiment reported, 
the performance of the I-M classifier is better than GC-M for image fullmoon, 
Figure [H However, the GC-M is expected to improve the performance of the 
I-M classifier if we used the proper copula. 
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Appendix 



Table 3. Description of images used in this work. BG stands for the background 
class and FG stands for the foreground class. Columns 3 and 4 give the size of test 
data. The last 4 columns give the number of pixels well classified for each class 
and for each probabilistic classifier. 



Image name 


Image size 


Test pixels 


I-M 


GC-M 






BG 


FG 


BG 


FG 


BG 


FG 


21077 


321 x 481 


4322 


3353 


3648 


2551 


3144 


2763 


24077 


321 x 481 


11529 


10983 


6577 


8399 


6348 


10000 


37073 


321 x 481 


8260 


6642 


6404 


6187 


7115 


6014 


65019 


321 x 481 


9853 


8398 


9181 


3317 


9099 


4061 


69020 


321 x 481 


25203 


22634 


17561 


16813 


22798 


20421 


86016 


321 x 481 


3271 


2215 


2765 


2166 


2937 


2179 


106024 


321 x 481 


9093 


7368 


5528 


3961 


5574 


4087 


124080 


321 x 481 


18286 


18773 


16487 


16924 


16307 


18653 


153077 


321 x 481 


13851 


12098 


11072 


7774 


10806 


10638 


153093 


321 x 481 


12027 


11809 


7617 


8699 


11414 


9615 


181079 


481 x 321 


23845 


23110 


18650 


15320 


22494 


18705 


189080 


481 x 321 


23363 


23523 


20726 


21020 


19722 


20707 


208001 


481 x 321 


10227 


9530 


9994 


7669 


10064 


7914 


209070 


321 x 481 


6696 


4075 


5117 


2447 


5894 


2874 


227092 


481 x 321 


19656 


17321 


12869 


8229 


19129 


13966 


271008 


321 x 481 


10909 


9216 


8934 


7967 


8800 


8795 


304074 


481 x 321 


7239 


4794 


5017 


2591 


5534 


2810 


326038 


321 x 481 


10781 


7680 


8730 


4952 


9488 


5571 


376043 


481 x 321 


13654 


13485 


12022 


6094 


13072 


9343 


388016 


481 x 321 


17800 


15592 


15633 


11248 


17596 


12929 


bananal 


480 x 640 


29983 


24052 


17120 


23964 


20285 


23601 


banana2 


480 x 640 


27433 


21518 


17063 


20378 


25373 


18698 


banana3 


480 x 640 


26205 


20164 


25588 


12405 


26035 


14115 


book 


480 x 640 


26087 


21474 


15689 


20699 


19852 


21325 


bool 


450 x 520 


20123 


16850 


19500 


13279 


18726 


14373 


bush 


600 x 450 


32513 


22099 


21072 


12504 


27734 


14870 


ceramic 


480 x 640 


30549 


25709 


24809 


25069 


27328 


24791 


cross 


600 x 450 


34602 


25733 


32824 


25703 


32918 


25132 


doll 


549 x 462 


18866 


15106 


12976 


13269 


17947 


14960 


elefant 


480 x 640 


27858 


22787 


20918 


22656 


23158 


22540 


flower 


450 x 600 


16125 


13246 


14612 


12977 


15036 


13225 


Continued on next page 
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Table 3. {continued) 



Image name 


Image size 


Test pixels 


I-M 


GC-M 






BG 


FG 


BG 


FG 


BG 


FG 


fullmoon 


350 x 442 


1580 


1043 


1498 


1043 


983 


1026 


grave 


600 x 450 


12294 


12832 


11977 


11567 


12219 


10889 


llama 


371 x 513 


8930 


8445 


7783 


5322 


7547 


7287 


memorial 


600 x 450 


14853 


12598 


12900 


6902 


10936 


10964 


music 


480 x 640 


23945 


19494 


20457 


18723 


21794 


19112 


personl 


450 x 600 


19092 


16384 


16452 


10041 


18831 


15372 


person2 


450 x 600 


12796 


9595 


11492 


5358 


12465 


9219 


person3 


600 x 450 


14649 


11450 


13494 


8122 


14022 


10112 


person4 


450 x 600 


19250 


16631 


15230 


10691 


18197 


11653 


person5 


600 x 450 


13990 


11332 


13009 


8327 


13025 


10377 


person6 


600 x 450 


19015 


15645 


16753 


9038 


16071 


11732 


person7 


600 x 450 


12110 


9634 


9934 


6998 


11795 


8093 


person8 


480 x 640 


16684 


12741 


6740 


11157 


14690 


9534 


scissors 


480 x 640 


30768 


23335 


28152 


19910 


30181 


19960 


sheep 


600 x 450 


5331 


3733 


4750 


3098 


5243 


3415 


stonel 


480 x 640 


18716 


15635 


16087 


15525 


18376 


15528 


stone2 


480 x 640 


22002 


18489 


21556 


16315 


21788 


17692 


teddy 


398 x 284 


13892 


13739 


13790 


13191 


13426 


13466 


tennis 


472 x 500 


19471 


13129 


18054 


8673 


18322 


8613 



Table 4. Evaluation measures for each image. TC stands for the Tanimoto coeffi- 
cient, BG stands for the background class and FG stands for the foreground class. 
Columns 2, 3 and 4 give the results for the independent probabilistic model. The 
last 3 columns give the results for the Gaussian copula-based probabilistic model. 



Image name 


I-M 


GC-M 




TC-BG 


TC-FG 


Accuracy 


TC-BG 


TC-FG 


Accuracy 


21077 


0.712 


0.633 


0.808 


0.640 


0.610 


0.770 


24077 


0.466 


0.527 


0.665 


0.507 


0.619 


0.726 


37073 


0.735 


0.728 


0.845 


0.801 


0.772 


0.881 


65019 


0.615 


0.366 


0.685 


0.641 


0.444 


0.721 


69020 


0.566 


0.555 


0.719 


0.832 


0.816 


0.903 


86016 


0.833 


0.796 


0.899 


0.888 


0.855 


0.933 


106024 


0.442 


0.362 


0.576 


0.450 


0.375 


0.587 


124080 


0.819 


0.823 


0.902 


0.886 


0.899 


0.943 


153077 


0.609 


0.523 


0.726 


0.706 


0.703 


0.826 


153093 


0.503 


0.536 


0.685 


0.803 


0.774 


0.882 


181079 


0.590 


0.541 


0.723 


0.796 


0.765 


0.877 


189080 


0.801 


0.804 


0.890 


0.753 


0.762 


0.862 


208001 


0.827 


0.786 


0.894 


0.850 


0.816 


0.910 


209070 


0.615 


0.433 


0.702 


0.746 


0.589 


0.814 


227092 


0.448 


0.341 


0.571 


0.831 


0.782 


0.895 










Contir 


uied on 


next page 



372 



R. Salinas-Gutierrez et al. 



Table 4. (continued) 



Image name 


I-M 


GC-M 




TC-BG 


TC-FG 


Accuracy 


TC-BG 


TC-FG 


Accuracy 


271008 


0.735 


0.712 


0.840 


0.777 


0.777 


0.874 


304074 


0.531 


0.369 


0.632 


0.600 


0.432 


0.693 


326038 


0.646 


0.509 


0.741 


0.736 


0.621 


0.816 


376043 


0.571 


0.403 


0.668 


0.735 


0.664 


0.826 


388016 


0.706 


0.633 


0.805 


0.860 


0.818 


0.914 


bananal 


0.569 


0.649 


0.760 


0.667 


0.699 


0.812 


banana2 


0.597 


0.639 


0.765 


0.839 


0.793 


0.900 


banana3 


0.753 


0.597 


0.819 


0.807 


0.694 


0.866 


book 


0.584 


0.649 


0.765 


0.757 


0.770 


0.866 


bool 


0.823 


0.760 


0.887 


0.829 


0.788 


0.895 


bush 


0.500 


0.373 


0.615 


0.698 


0.553 


0.780 


ceramic 


0.795 


0.797 


0.887 


0.868 


0.857 


0.926 


cross 


0.948 


0.934 


0.970 


0.935 


0.917 


0.962 


doll 


0.627 


0.632 


0.773 


0.944 


0.934 


0.969 


elefant 


0.747 


0.762 


0.860 


0.824 


0.820 


0.902 


flower 


0.891 


0.879 


0.939 


0.931 


0.923 


0.962 


fullmoon 


0.948 


0.927 


0.969 


0.616 


0.626 


0.766 


grave 


0.883 


0.880 


0.937 


0.858 


0.844 


0.920 


llama 


0.646 


0.555 


0.754 


0.748 


0.741 


0.854 


memorial 


0.628 


0.474 


0.721 


0.663 


0.664 


0.798 


music 


0.828 


0.815 


0.902 


0.896 


0.883 


0.942 


personl 


0.647 


0.528 


0.747 


0.937 


0.924 


0.964 


person2 


0.675 


0.492 


0.753 


0.946 


0.929 


0.968 


person3 


0.751 


0.644 


0.828 


0.877 


0.837 


0.925 


person4 


0.605 


0.518 


0.722 


0.751 


0.659 


0.832 


person5 


0.765 


0.676 


0.843 


0.872 


0.844 


0.924 


person6 


0.654 


0.505 


0.744 


0.701 


0.631 


0.802 


person7 


0.674 


0.593 


0.779 


0.864 


0.813 


0.915 


person8 


0.369 


0.492 


0.608 


0.739 


0.647 


0.823 


scissors 


0.823 


0.767 


0.888 


0.884 


0.834 


0.927 


sheep 


0.796 


0.718 


0.866 


0.928 


0.894 


0.955 


stonel 


0.855 


0.850 


0.920 


0.976 


0.972 


0.987 


stone2 


0.892 


0.862 


0.935 


0.956 


0.946 


0.975 


teddy 


0.955 


0.953 


0.976 


0.948 


0.948 


0.973 


tennis 


0.755 


0.596 


0.820 


0.764 


0.603 


0.826 
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Abstract. Understanding the protein-to-protein interactions at the subcel- 
lular level, as well as other organic molecules, is crucial to explain cellular 
functions and to elucidate disease mechanisms. These interactions can be 
captured visually by overlapping the fluorescent microscopic images of two 
proteins tagged with fluorescent labeling agents that react to green and red 
wavelengths respectively. Interaction is determined by subjectively assessing 
the amount colocalization of green and red on the image composite based on 
the amount of yellow present in the image composite (i.e., green and red form 
yellow). Attempts to reduce the subjectivity of this process have focused on 
the computation of statistical coefficients and related methods. Even though 
statistical colocalization coefficients give a degree of correlation among the 
imaged proteins, they still need to be interpreted with subjective qualifiers 
like "high", "low", "strong", etc. Hence, there is no current agreement on the 
meaning of these coefficients among researchers. In this paper we propose 
the use of fuzzy linguistic variables to model the subjective interpretation 
of co-localization coefficients. Based on interpretations found in the litera- 
ture, we produce a set of rules that map the coefficient values to a linguistic 
interpretation. The result of this work is a tool that provides an descrip- 
tive ensemble of coefficient interpretations that could guide researchers to a 
uniform interpretation colocalization criteria. 

1 Introduction 

Fluorescence microscopy makes possible the functional analysis of proteins 
and other other sub-cellular structures [H [H [3]. This imaging technique 
consists of tagging proteins of interest using fluorescent labeling agents 
which react to a specific wavelength. The spatial distribution of a protein 
over a cell is captured through an image sensor tuned to the fluorochrome 
wavelength. 
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Fig. 1. An example image from the Yeast database. In (a) is shown the green 
channel. In (b) is shown the red channel, and in (c) the two joint channels. In (d) 
we have the scatter diagram showing the pixels distribution across the two channels. 



Among the many uses of fluorescent microscopy images, it is possible to 
assess the interaction among two or more proteins at the sub-cellular level. 
Understanding these interactions is crucial to explain cellular functions and 
to elucidate disease mechanisms. The simultaneous presence of two proteins 
in the same sub-cellular compartment indicates a functional relationship for 
some specific biological process. Imaging the same specimen with two pro- 
teins tagged to express at different wavelengths (e.g., green and red), allows 
us to register the spatial distribution of the proteins simultaneously. By over- 
laying both images on the same plane, yellow regions would appear on those 
regions where red and green pixels spatially overlap. This overlaying method 
and the process of assessing the relative overlap (i.e., yellow regions) among 
the two proteins is known as colocalization. 

An example using images from the yeast database [U E] is shown in 
Figure HJ including the highlight of areas with strong visual colocalization. 
Biologists typically do a subjective assessment of colocalization through vi- 
sual inspection of image overlays. Hence, colocalization results can vary from 
one person to another, or from one study to another. Attempts to reduce 
the subjectivity of this process have focused on the computation of statis- 
tical coefficients El [TO] and related methods [HI El US]. Even though 
statistical colocalization coefficients give a degree of correlation among the 
imaged proteins, they still need to be interpreted with subjective qualifiers 
like "high", "low", "strong", etc. Hence, there is no current agreement on the 
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meaning of these coefficients among researchers, fn addition, they seem to be 
strongly influenced by the acquisition and post-processing steps applied to 
the images. 

In this paper we propose the use of fuzzy linguistic variables p3J [13 [HE 
[T7I ITS] to model the subjective interpretation of co-localization coefficients. 
Based on interpretations found in the literature, we produce a set of rules 
that map the coefficient values to a linguistic interpretation. The result of this 
work is a tool that provides an descriptive ensemble of coefficient interpreta- 
tions that could guide researchers to a uniform interpretation colocalization 
criteria. The proposed scheme can be described in two steps. First, using 
fluorescence microscopy images, a quantitative description of colocalization 
is obtained through the use of statistical coefficients reported in the liter- 
ature El HU H2 E2 Q3H H3] as features to fuzzy models. Second, based 
on the qualitative interpretation of different colocalization case studies, we 
constructed fuzzy linguistic variables to represent the quantitative results. 
This was followed by membership functions modeling. We tested our fuzzy 
model over the yeast and A431 cells image databases. The proposed model 
outputs natural language fuzzy predicates providing a consistent description 
of quantitative colocalization analysis results. 

The paper is organized as follows. In Section 2 we give a brief introduction 
to the fluorescence microscopy imaging process. The quantitative colocaliza- 
tion feature extraction process is discussed in Section 3. In Section 4 we dis- 
cuss the design our fuzzy logic-based subjective colocalization analysis model. 
Examples of the model utilization are presented in Section 5. We discuss our 
conclusions and subsequent work in Section 6. 

2 An Overview of Fluorescence Microscopy 

Fluorescence microscopy is an imaging technique where proteins (or some 
other molecule) are tagged using fluorescent labeling agents (fluorophores) 
which react to a specific wavelength. When different proteins on the same 
specimen are tagged, a multichannel image is generated, where each channel 
captures the contribution of each fiuorophore. The image formation process 
can be approximated as a linear spatially-invariant system where the intensity 
distribution is modeled as QUI 



I(ni,n 2 ,n 3 ) oc 

'SS 3 



. m n 2 n 3 

h\\ —^ — u, -^ — v, — — w 
? ? <? 



x{u,v 1 w)dudvdw, (1) 



where A = -gp- is the emitted light wavelength, E em is the energy level 
difference during light emission, i is Plank's constant, £ is the speed of light, 
f is the magnification of the microscope objective, \ is an object dependent 
function related to the light emission properties of the fiuorophore. 

The function \h\\ is known as the point spread function (PSF) of the 
microscope. It can be measured experimentally or modeled using the physical 
properties of the system. A common mathematical model is given by |19j 
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h x (n 1 ,n 2 ,n 3 )= [ P(u,v)e j2 ™ 3! ^£e- j2n21 ^ L dudv, (2) 

which represents the inverse Fourier transform of the circular aperture of 
the objective P(u,v). The parameter a is the focal length of the objective 
and is inversely related to the numerical aperture (NA) of the microscope. A 
detailed discussion of this model can be found in [H] . 

As with other imaging techniques, images are acquired under non-ideal 
conditions that generate undesirable distortions or aberrations. In microscopy 
imaging, detection of light emissions is severely limited by the properties of 
photo sensors. Light detection is effectively down to counting the number of 
photos incident upon a photodctcctor. At such level, the photon emission is 
described by a Poisson random variable which generates noise contaminated 
images [El EH [22J |23j . Background shading is also a common distortion and 
can be caused by several factors: non uniform illumination, inhomogeneus 
detector sensitivity, dirt particles in the optics, nonspecific sample staining, 
and even auto-fluorescence. The background shading process can be modeled 
as 

b(ni,n 2 ) = I(ni,n 2 )a(ni,n 2 ), (3) 

where b(ni,n 2 ) represents the product of the illumination I(ni,n 2 ) and the 
original sample a{n\,n 2 ) [HI [23]. Now, if we model the detector's gain and 
offset we have 

c(ni,n 2 ) = g(ni,n 2 )b(ni,n 2 ) + o(ni,n 2 ), (4) 

where g(ni, n 2 ) is some gain and o(ni, n 2 ) some offset. Using §3§ to substitute 
in (H]) we have 

c(m,n 2 ) = g(n 1 ,n 2 )I(n 1 ,n 2 )a(ni,n 2 ) + o(n 1 ,n 2 ). (5) 

These defects need to be corrected before analysis is performed. Digi- 
tal enhancement and restoration techniques have been extensively studied 
[12 [9l [lOl E3 [26] . In [27] we introduced a restoration method that recovers 
a{n\,n 2 ) given the noisy observation c{n\, n 2 ) described in ©. The proposed 
method is simple and performed better than traditional ones. Essentially the 
method consists of a local median filter (e.g. 3x3 followed by background 
subtraction. For an image consisting of red (R) and green (G) channels, a 
restored image a(ni, n 2 ) is generated by 

Z{n u n 2 )^\^^Y { P l,n2 \ G V d{ ^ n2 \ G . (6) 

[Q[3xS\{ c {ni,n 2 ) R }-o{ni,n 2 )R 

where g[3 X 3] {•} is a 3 x 3 median filter is applied to each channel and 6(ni, n 2 ) 
is the offset estimate obtained as the average of a set of representative images 
from the database. 

To assess restoration filter performance is computed using different metrics 
over synthetic and real life images. Consider the synthetic images shown 
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Fig. 2. In (a) is shown the "circles and lines" image, and its degraded version in (b). 
The "cells" image is shown in (c), and its degraded version in (d). In (e) is presented 
the original amplified section of circles and lines, and its restored image in (f). The 
original amplified cells in (g), and its restored image in (h). The results show very 
good performance over Poisson noise and offset, visually and quantitatively. 



in Figure [5] (a) and (c). Synthetic images were degraded with a constant 
offset 0(711,712) = 77 added to each pixel. Then, images were contaminated 
with random Poisson noise. Figure [5] (a) is a typical case of study in image 
processing jHHB] used to address the ability of a filter to recover edges, lines, 
and shape of the original image given a noisy observation as in (b). Likewise, 
Figure [2] (c) is representative of a microscopy imaging scenario, and the goal 
is to test the filter ability to recover texture, edges, and illumination given 
a degraded image as in (d). In the test of Figure [2] (b) we used 77 = 89 and 
in Figure [5] (d) r\ = 11. Clearly, the results over synthetic images shown in 
Figure [5] (f ) and (h) are very good when compared to the true images in 
Figured] (e) and (g). Therefore, we proceeded to test restoration results over 
real life data. 

For the case of real biological images, we evaluated the restoration filter 
with two databases. First, we used the yeast database 16J which contains 547 
two channel image samples. It is a very good alternative to test the restoration 
methods for colocalization analysis. In Figure [3] (a) is shown an example of 
the content of the yeast database as well as its restored version using M33GM 
in Figure [5] (a) . 

The second database was reported in [5S] and consist of visible fluo- 
rescent fusion-proteins expressed in A431 cells and a fluorescent label. The 
crbB family of receptor tyrosine kinases includes the epidermal growth factor 
receptor (erbBl) and erbB3. These membrane proteins regulate cell growth 
and differentiation through binding of ligands to their extracellular domain, 
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Fig. 3. In (a) is shown the noisy raw image from the Yeast database and in (b) its 
restored image 



ni 


r jfl, 3p2Sjj 


Rj 


JBh 
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69 



Fig. 4. Visible fluorescent fusion-proteins expressed in A431 cells. In (a) the noisy 
image, and in (b) the restored image 



which activates the protein and initiates signalling. We identify this group of 
images as the A431 cells database. An example image is shown in Figure|4](a), 
and the results of restoration are shown in Figure 2] (b). The filter effectively 
reduces Poisson noise, preserves the texture, and subtracts the offset. 

3 Quantitative Colocalization Feature Extraction 



The quantitative analysis of colocalization consists of computing the 2D sig- 
nal spatial overlap across multiple channels. This high precision analysis 
allows researchers to understand the mechanisms of protein-to-protein 



Subjective Colocalization Analysis with Fuzzy Predicates 379 



a) 


W 




* i 


4 





Fig. 5. An example of colocalization. In (a) is shown the green channel G(ni , 112); in 
(b) is shown the red channel R(n\, 712); in (c) are shown the superimposed channels; 
and in (d) are shown in white, the regions of more colocalization. 



interactions. [H [5J [TT] [T2] [T3] E3 [35]. As an example, consider the cellu- 
lar image acquired by the fluorescence microscopy method; its green channel 
G(ni,ri2) is shown in Figure [5] (a), and its red channel R(n\,ri2) is shown 
in Figure [5] (b) . We can study the cellular protein-to-protein interaction by 
observing and quantifying the superposition of Figure [5] (a) and Figure [5] (b) 
as in Figure [5] (c). The regions of more colocalization are highlighted in white 
on Figured] (d). Ultimately, visual assessment of colocalization is a subjec- 
tive process highly dependent on the experience of the analyst. Attempts to 
reduce the subjectivity of this process have focused on the computation of 
statistical coefficients or parameters [7] [5J [TU] [TT] [25] and algorithms that 
exploit them [TJ [TH] HOI US HU • There are specific parameters (i.e., features) 
used to estimate the degree of colocalization between two channels. We have 
identified 14 colocalization features in literature. In this paper, we will de- 
scribe and use the five most popular features: Pearson's correlation coefficient, 
Overlap coefficient, Fraction of colocalizing regions, Mander's colocalization 
coefficient, and Intensity correlation coefficient. 

Table [JJ summarizes the properties of the coefficients that will be explained 
in detail next. 



3.1 Pearson's Correlation Coefficient rp 

Also known as Correlation Coefficient, the Pearson's Correlation Coefficient 
is widely used and accepted, specially in regression applications. It provides 
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Table 1. Summary of features (coefficients) for quantitative colocalization 



Name 


Variables 


Range 


Ref. 


Pearson's Corr. C. 


r P 


[-1,1] 


9, 10, 25 


Overlap C. (Mult. Meth.) 


r 


[0,1] 


7, 12, 13 


Frac. Coloc. Reg. (O. C.) 


fci,fe 


vary 


7, 12, 13 


Mander's Coloc. C. - Gen. 


Afi, M a 


[0,1] 


7 12. 13 


Intensity Corr. Quot. 


ICQ 


[-0.5,0.5] 


m 



information about the relationship between the region of intensities and their 
distribution [3 [HI M OH ED OH EE3] ■ The Pearson's Correlation Coefficient r P 
is defined as 



T P 



E {R{ni,n 2 ) - E[R]) (G(m, n 2 ) - E[G\) 
1/J2 (R( ni ,n 2 ) - E[R]f £ (G(n u n 2 ) - E[G]f 



(7) 



Its domain is between [—1,1]. It will be —1 if the data has negative linear 
relationship, a particular meaning is hard to explain. It will be (or close to 
zero) if the data has perfect exclusion or no linear relationship and the pixels 
are distributed along the entire scatter diagram. If the value is 1 it means 
perfect correlation; the data has a linear relationship. For the case shown in 
Figure [5] the value is rp = 0.7267. 

3.2 Overlap Coefficient r (The Multiply Method) 

This coefficient dictates the overlap between two channels; it shows a degree of 
colocalization |12j . In contrast with Pearson's, this coefficient does not return 
negative values, does not average any pixel intensity, and it is not sensitive to 
intensity variations [TTJ [T3] . However, the authors of [7] recommend its usage 

under the following condition: "i-" 2 p , — : — r 

formally defined as 



1 . The Overlap coefficient is 



J2 G(n 1 ,n 2 )R(n 1 ,n 2 ) 

ni,n 2 

£ G{ ni ,n 2 f Y. R(ni,n 2 )* : 



(8) 



where its domain is between [0, 1]. A value of means that no pixels overlap; 
while 1 means that all the pixels overlap. The case when r = 0.5 implies that 
50% of the pixels overlap with each other. For the case shown in Figure [5] 
r = 0.8316. It is clearly high because of the non-zero background content. 



3.3 Fraction of Colocalizing Regions fei, k 2 (Overlap Coefficients) 

These coefficients represent the differences between each channel intensities 
[TT| IT2"1 [13] . This two coefficients overcome the problems generated from a re- 
striction in the overlap coefficient in (|8|). However, fei, k 2 are very sensitive to 
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the absolute fluorescent intensity. If one channel has been treated differently 
from the other, such that the total intensity vary (bleaching for instance), 
this will affect the coefficients [7j. We can denote the coefficients as follows 

J2 G(ni,n2)R(ni,n 2 ) 

fc i = ^ nl ^ . (9) 

iil,n 2 

J2 R(ni,n 2 )G(ni,n 2 ) 
k 2 = „ , r^ ■ (10 

2^ R{ni,n 2 y 

m,na 

The individual range of values may vary. Results can be interpreted as some 
indicator of each antigen's contribution to colocalization areas. 

3.4 Mander's Colocalization Coefficients Mi, M 2 

The coefficients Mi, describe ith channel's contribution to colocalization 
using its intensity values. These coefficients are proportional to ith chan- 
nel's fluorescence amount, relative to the ith channel total fluorescence. Such 
relationship is described in [TTJ [12 H3] as 

£ (G(ni,n 2 )|i?(ni,n 2 )>0) 

Mr = ^ =-^ , , (11) 

2^ G(ni,n 2 ) 

"1,«2 

E {R{n u n 2 )\G{ ni ,n 2 ) > 0) 

M 2 = ^ , (12) 

2^ R{ni,n 2 ) 

n\ ,ri2 

where Ml, M 2 are within the range [0, 1]. The meaning can be explained with 
the following example: if Mi = 1.0 and M 2 = 0.3, it means that green channel 
pixels colocalize with red's, but only 30% of pixels in red channel colocalize 
with green's. 

3.5 Intensity Correlation Quotient ICQ 

Two images vary around their respective mean if their intensities vary in syn- 
chronous [7] . Therefore the product of the differences from the mean (PDM) 
will be positive for such images, PDM > 0. However, if the intensities vary 
asynchronously, the PDM will be negative, PDM < 0. The PDM analyzes 
the relationship between intensities, and is denoted as 

PDM(n u n 2 ) = (G(m, n 2 ) - E[G\) (R(m, n 2 ) - E[R]) , (13) 

where E[G] and E[R] denote the expected value for each channel respectively. 



382 P. Rivas-Perea, J. Gerardo Rosiles, and W. Qian 

The ICQ value is based on the sign of the PDM [7J. The ICQ is defined as 
the PDM positive occurrences number £, divided by the negative occurrences 
£. Thus, the ICQ is the quotient denoted as 

ICQ=(±\- 0.5. (14) 

The range of ICQ falls between [—0.5,0.5]. The coefficient value can be 
interpreted as follows: if the ICQ w 0, means random decoloration; if —0.5 < 
ICQ < 0, means segregated decoloration; and if < ICQ < 0.5 means a 
dependent decoloration. The latter case is an indicator of good colocalization. 

4 Colocalization Subjective Quantification with Fuzzy 
Logic Theory 

The colocalization study performer often would like to obtain a linguistic 
result from a coefficient rather than just a rough number. Consider Pear- 
son's correlation coefficient rp = 0.865 after some colocalization experiment. 
Then, the observer of the study would probably say, "Pearson's correlation 
coefficient is high." However, this is a subjective assessment that may not 
correspond to another observer whose definition of high may be a coefficient 
valued at 0.99. Hence, there is paucity of reference levels to map coefficient 
values to a human like quantification. The process of giving such subjective 
quantification is called fuzzification 18 . Prior knowledge regarding the vari- 
ables to fuzzify is desired, but not mandatory. Information about the variable 
range and distribution is useful. In this section we address the problem of the 
subjective quantification by using fuzzy logic theory. 

4.1 Linguistic Variables in Fuzzy Logic 

A linguistic variable is a word represented by a fuzzy set [THj. For example, 
a linguistic variable can take an attribute like "negative, positive, zero, high, 
low, more or less" etc. Each of this words has its own membership function. 
Following the notation in [TB], let us define a linguistic variable by the 
following quintuple 

(x,T(x),U,G,M) (15) 

x : name of variable 
T (x) : set of possible linguistic terms for x 
U : set of universe of discourse 
G : syntactic grammar that produces terms in T{x) 
M : semantic rules which map terms in T(x) to 
fuzzy sets in U . 
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Given the previous definitions, we introduce the translation of coefficients 
in Section [3] to linguistic variables. To distinguish between a crisp and a 
linguistic (fuzzy) variable we use the symbol ". For instance, the crisp Pear- 
son's correlation coefficient is denoted as rp, while the linguistic Pearson's 
correlation coefficient is defined as rp. 

4.2 Fuzzy Pearson's Correlation Coefficient 

From [T2] we know that rp's universe of discourse is [—1,1]. Now, we must 
find the possible words and their membership functions. In [12j it is found 
that —1 corresponds to "negative" correlation; so, let (J-n% g (u) denote this 
possible term for rp. From [7] and [H] we also can find n\[ t {u) as a "little" 
correlation, with an average value close to 0; as well as "reasonable" /i^ a (u), 
with an average value close to 0.346. Finally, we found "strong" correlation 
l^tr( u )i centered at 0.920. In |5S] can be found another two possible terms. 
The first is "high" p^„(u), in the range [0.685,0.877], with an average value 
of 0.761. The second term is "low" t^ w {u), in the range [0.434,0.615] with 
average 0.516. 

At universe of discourse upper and lower limits, "s membership functions" 
(SMF) and "z membership functions" (ZMF) are commonly used since they 
consider the inclusion of such limits. In comparison, a Gaussian membership 
function does not fully cover the limits of the universe of discourse. Formally 
a "z membership function" is defined as 



fi(u) = < 



1, u < a 



(16) 
0, u> b 



where parameters a and b define the start and end of the function respectively. 
Similarly, an "s membership function" is formally defined as 



fi(u) 



0, u < a 

2(rf 2 , a<u<z±> 



2 
b-a ) ' ~~ 2~ 



1-2(6-^-) ,^<u<b 



(17) 



1, u> b 



where the parameters a and b define the start and end of the function respec- 
tively. A Gaussian-shaped membership function (GMF) falls in the family of 
the exponential and radial basis functions; and it is defined as 

n(u) = e =i ^ ) (18) 
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where c is the centering parameter, and a is the spread parameter. A similar 
function is the 7r-shaped membership function (PIMF) defined as 



H{u) 



1 



I 26' 



(19) 



with parameters a and 6 defining the start and end of the function respec- 
tively; and c as the centering parameter. Now, following the definition in 
(fT5|) . we can define fp as follows 



r P 

x 

T(x) 

U 

G(x) 



M 



(20) 



Pearson's Correlation 

{negative, little, reasonable, strong, high, low} 

T i+l = {strong} V {very T 1 } 
(u)) ,(u,i4f t (u)j , 

(«)) . \U,H r s f r (u)J , 



\ u i t J 'neg\ 
[U, H rea \ 



(«>**!£«,(«)) 



u€U 



where the operator V denotes the fuzzy operator max; /i^e ff (w) uses a function 
of the type defined in (Tpo|) with parameters a = — 1, and 6 = 0; /^(u) uses 
a function of the type defined in (fTH|) with parameters c = 0, and a = 0.2; 
Hrea( u ) uses a function of the type defined in (fig]) with parameters c = 0.346, 
a = 0.2; fi^ r (u) uses a function of the type defined in (|T5|) with parameters 
c = 0.92, a = 0.2; fJ^„(u) uses a function of the type defined in ([15]) with 

parameters c = 0.761, a = 0.2; ^\g W {u) uses a function of the type defined in 
(JTHJ) with parameters c = 0.516, cr = 0.2; and finally Hpo 8 {u) uses a function 
of the type defined in (JTTj) with parameters a = 0.434, and 6 = 1. 

4.3 Fuzzy Overlap Coefficient 

In [71 H2] we can find that the universe of discourse for the linguistic variable 
r is [0,1]. From [7J we obtain n r ran (u) as a "random colocalization" overlap, 
centered near to 0.5; as well as "high colocalization" /j,^ (u), when it is 1. 
Finally, we found "low colocalization" /Jq ow (u), when the value reaches 0. So, 
following the notation in (TT5|) . we can define r as 
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r = 
x : 



(21) 



T{x) 

U 
G(x) 

M 



Overlap Coefficient 
random colocalization, 
high colocalization, 
low colocalization 

[0,1] 

T i+l = {low} V {very T 1 } 

(«> lAigW) » 



where if ran {u) uses (fTU|) for a = 0.25, 6 = 3, and c = 0.5; H r hig {u) uses (fTT|) 
for a = 0.5, and 6 = 1; and finally, fJ>J ow (u) uses (JT6J) for a = 0, and 6 = 0.5. 

4.4 Fuzzy Fraction of Colocalizing Regions (Overlap Coefficients) 

For this coefficient, the most used is the k\ coefficient since it shows the ratio 
between channel red versus green [7]. As specified previously this variable 
has a variable universe of discourse; however, the most common values lie 
close to one. From this finding we can define the linguistic variable k\ over 
the range [—00,00]. The variable k\ has one membership function centered 

at one and is denoted as fJ- V ai{u) which is a "valid" ratio between channels. 
Following the notation in (fT5|) . we can define k\ as 



fci = 



(22) 



Ch.Red/Ch. Green ratio 
{valid, not valid} 

[—00, 00] 

T(x) = 1 - T(x) 

(«./*L(«))> 



X 

T(x) 

U 

G(x) 

M 



where M„a/( n ) uses (O) for c = 1, and a = |; while \^ m (u) is just 1— fi v li(u). 

4.5 Fuzzy Mander's Colocalization Coefficients - General 

The colocalization coefficients Mi,M% from Mander's are defined over the 
universe of discourse [0, 1] as explained previously, thus the linguistic vari- 
ables Mi,M2 will be defined in this universe as well, fn [7j is described 
the word "significant" n si Uu) for a value of 0.746. Also we can find the 



Mi 



word "not much" Aw m (w) for 0.155, as well as "high" [i^Ju) for the range 
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of [0.879,0.993] (with mean value 0.936). Similarly, we can find the word 
"strong" ^ (u) defined over the range [0.970, 0.998] (with mean value 0.984). 
We also find Hnoc( u ) which is "not coincident" correlation for values less than 
0.008. Finally the word "hard" ^ r {u) is found for a value of 0.590. Following 
the notation in ([151 . we define M\ as 



Mi = 
x 

T{x) 

U 
G{x) 



(23) 



Colocalization Coefficient for Red Channel 
not coincident, not much, hard, 
significant, high, strong 

[0,1] 

T t+1 = {strong} V {very T 1 } 



M 



(u, VnocW) > 

(u,m£L(u)), (u,m£.(«)). 



Mi, 



1V1\ / \ 



M€f/ 



where \x^ c {u) uses ([16} for a = 0.008 and fo = 0.155; ^ m {u) uses (H^) for 
c = 0.155 and a = 0.2; fi^(«) uses flU for c = 0.590 and a = 0.2; M^(«) 
uses (03) for c = 0.746 and a = 0.2; /i^(«) uses (JTHJ) for c = 0.936 and 

a = 0.2; finally (i^(u) uses (HH) for a = 0.746 and b = 0.984. The linguistic 
variable M2 is defined in the same way as Mi, but the only difference is the 
value for x which is "Colocalization Coefficient for Green Channel." 



4.6 Fuzzy Intensity Correlation Quotient Coefficient 

Since the ICQ coefficient pj [30] is defined over the universe of discourse 
[—0.5,0.5], the linguistic variable ICQ will be defined in this universe as 
well. In [7] is described the word "segregated" n 1 ^® (u) for a values in the 
range [—0.5,0]. It is found as well the word "random" jU^on ( u ) f°r values 
near to zero. Finally the word "dependent" H de ^{ u ) is found for values on 
the range [0,0.5]. Following the notation in (JTHJ), we define ICQ as 
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ICQ = 

x 
T(x) 

U 
G{x) 

M 



(24) 



Intensity Correlation Quotient 

{segregated, random, dependent} 

[-0.5,0.5] 

T l+1 = {segregated} V {very T 1 } 

'(tO) > (t 



u,vlg Q < 



.«>/*'£?(") 



ICQ, 



UGU 



where nieg Q ( u ) uses (HSJ f° r a = ~°- 5 an d b = 0.5; /4SP( W ) uses (HHJ) for 
c = and er = |; and finally H de p{u) uses (fT7|) for a = —0.5 and b = 0.5. 

4.7 Fuzzy Predicates from Linguistic Variables 

Fuzzy predicates are a semantic representation of a sentence. For instance, 
"It is cold outside." Here we identify "cold" as the linguistic variable. In 
colocalization analysis, we defined fuzzy predicates. As an example, consider 
the fuzzy variables and the fuzzy predicates shown in Table [21 The syntax is 
flexible so that the colocalization study performer can pick the combination 
that best represent the human-like reasoning or help the best in human-like 
decision making. Human-like knowledge representation can be achieved using 
ordinary fuzzy predicates. Such fuzzy predicates depend on linguistic vari- 
ables. For the particular case of quantitative colocalization analysis trough 
florescence microscopy, fuzzy membership functions were created according 
to the cases found in literature. In Figure[5]we present a summary of the mod- 
eled membership functions. The relevance of this research can be summarized 
in two parts: first, we addressed the inaccurate subjective human interpre- 
tation of colocalization, providing a formal review of quantitative colocal- 
ization coefficients; second, using fuzzy logic, we designed linguistic variables 



Table 2. Definition of linguistic variables and fuzzy predicate examples 



Fuzzy Predicate Definition 


Fuzzy Predicate Example 


u r P [x] is fp[T{x)f 


Pearson's Correlation is strong 


"r[x] is showing r[T(x)]" 


"Overlap Coefficient is showing high 




colocalization" 


"ki[x\ is ki[T(x)]" 


"Ch.Red/Ch. Green ratio is not valid" 


u Mi[x] shows Mi[T(x)] 


"Colocalization Coefficient for Red 


colocalization" 


Channel is showing significant 




colocalization" 


li fCQ[x] shows ICQ[T{x)} 


"Intensity Correlation Quotient 


staining" 


Coefficient shows random staining" 
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Pear s on's Corre la tion 
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Fig. 6. Graphical representation of the linguistic variables, and its associated 
membership functions modeled from a literature review 



based on literature review and provided an accurate human-like colocalization 
quantification. 



4.8 Experimental Results and Discussion 

We performed several experiments in different databases such as Yeast and 
A431 cells. The experiments consist of computing the previously described 
coefficients and using them as inputs to a fuzzy system. The fuzzy system 
consists of five linguistic variables that take its associated coefficient numer- 
ical value and produce a set of human like colocalization quantification. The 
linguistic variables x are constructed in the form of (|15|) . The output of each 
linguistic variable is computed by taking the maximum valued membership 
function associated to that linguistic variable. Then, we use the linguistic 
term T(x) associated to the maximum valued membership function to con- 
struct a human-like sentence (fuzzy predicate). This procedure can be for- 
malized in four steps: 

Step 1. Image restoration. Channels R{ni,ri2) and G(ni,ri2) are restored 
using the filter discussed in Section 2 to obtain c(n±, 712). 
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Table 3. Fuzzy predicates obtained from samples in Figures [T] [3] HI and|S] 



Figure 



Figure Q] 



Figure [3] 



Figure |4] 



Figure 



Fuzzy Predicates. 



Pearson's Correlation is strong. 

Overlap Coefficient is showing high colocalization. 

Ch.Red/Ch. Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows 

strong colocalization. 

Colocalization Coefficient for Green Channel shows 

strong colocalization. 

Intensity Correlation Quotient Coefficient 

shows segregated staining. 



Pearson's Correlation is strong. 

Overlap Coefficient is showing high colocalization. 

Ch.Red/Ch. Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows 

strong colocalization. 

Colocalization Coefficient for Green Channel shows 

strong colocalization. 

Intensity Correlation Quotient Coefficient shows 

segregated staining. 



Pearson's Correlation is strong. 

Overlap Coefficient is showing high colocalization. 

Ch.Red/Ch. Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows 

strong colocalization. 

Colocalization Coefficient for Green Channel shows 

strong colocalization. 

Intensity Correlation Quotient Coefficient shows 

segregated staining. 



Pearson's Correlation is high. 

Overlap Coefficient is showing high colocalization. 

Ch.Red/Ch. Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows 

strong colocalization. 

Colocalization Coefficient for Green Channel shows 

strong colocalization. 

Intensity Correlation Quotient Coefficient shows 

segregated staining. 



Step 2. Feature extraction. Features proposed in Section 3 are extracted 
from 8(711,712) to form a row vector F — [rp,r, [fci,^], [Mi, M%] , ICQ] . 

Step 3. Fuzzification. Features vector F is fuzzified according to the lin- 
guistic variable associated to each feature. Follows to construct a row vector 

F = \fp, r, l&i.fal , \Mi,M 2 } , ICQ] 
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Table 4. Fuzzy predicates obtained from the A341 cells database 




Pearson's Correlation is high. 

Overlap Coefficient is showing high colocalization. 

Ch.Red/Ch. Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows strong colocalization. 

Colocalization Coefficient for Green Channel shows strong colocalization. 

Intensity Correlation Quotient Coefficient shows segregated staining. 

Pearson's Correlation is strong. 

Overlap Coefficient is showing high colocalization. 

Ch.Red/Ch.Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows strong colocalization. 

Colocalization Coefficient for Green Channel shows strong colocalization. 

Intensity Correlation Quotient Coefficient shows segregated staining. 

Pearson's Correlation is high. 

Overlap Coefficient is showing high colocalization. 

Ch.Red/Ch.Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows strong colocalization. 

Colocalization Coefficient for Green Channel shows strong colocalization. 

Intensity Correlation Quotient Coefficient shows segregated staining. 







Pearson's Correlation is high. 

Overlap Coefficient is showing high colocalization. 

Ch.Red/Ch.Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows strong colocalization. 

Colocalization Coefficient for Green Channel shows strong colocalization. 

Intensity Correlation Quotient Coefficient shows segregated staining. 

Pearson's Correlation is low. 

Overlap Coefficient is showing random colocalization. 

Ch.Red/Ch.Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows strong colocalization. 

Colocalization Coefficient for Green Channel shows strong colocalization. 

Intensity Correlation Quotient Coefficient shows segregated staining. 

Pearson's Correlation is low. 

Overlap Coefficient is showing random colocalization. 

Ch.Red/Ch.Green ratio is not valid. 

Colocalization Coefficient for Red Channel shows strong colocalization. 

Colocalization Coefficient for Green Channel shows strong colocalization. 

Intensity Correlation Quotient Coefficient shows segregated staining. 



Step 4- Fuzzy predicates. For each element of the fuzzyfied features vector 
F the linguistic term T{x) is extracted according to the semantic rules M 
associated to each linguistic variable. Then, fuzzy predicates are constructed 
according to the skeleton shown in Table [5J 

Let us consider the case of the samples shown in Figures [1] El 01 and El 
The fuzzy predicates obtained from these cases are presented in Table El 
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Clearly, fuzzy predicates alleviate the need for quantitative results in natural 
language. Consider the experiments with the A341 cells database shown in 
Table 01 These cases are also successful experiments in the construction of 
colocalization fuzzy predicates. 

The results shown represent a paradigm change in the quantitative colo- 
calization analysis since current problems are solved visually and by hu- 
man analysis. In this paper is proposed the usage of features and fuzzy 
linguistic variables to provide an invariant human-independent result of 
colocalization. Results provide a descriptive set of colocalization coefficient 
interpretations that could lead researchers to a uniform and consistent 
interpretation colocalization. 

5 Conclusion 

In this document we addressed the subjectivity problem in quantitative 
colocalization analysis trough fluorescence microscopy. We presented the most 
common quantitative colocalization coefficients reported in literature. These 
coefficients are utilized to model a fuzzy system. Linguistic variables are mod- 
eled using fuzzy logic theory from the reported colocalization coefficients. Fuzzy 
membership functions were constructed based on literature review. The pro- 
posed fuzzy model provides natural language quantitative colocalization 
results through the evaluation of membership functions associated with lin- 
guistic variables. Our model demonstrated successful assessment of quantita- 
tive colocalization with natural language results. Results over the Yeast and 
A431 cells database show consistency in the computations and in the quan- 
titative colocalization assessment. The proposed model can be utilized in the 
field of biological sciences where confocal fluorescence microscopy techniques 
are used in the analysis of protein-to-protein interactions. This will alleviate 
the paucity of formal quantitative colocalization analysis using novel compu- 
tational intelligence methods. The proposed model provides a human-like en- 
semble of colocalization coefficient interpretations that could lead researchers 
to a uniform and consistent interpretation colocalization. 
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Abstract. In this article we approach the linear ordering problem with cu- 
mulative costs (LOPCC). Bertacco developed this problem jl] and propose 
two exact algorithms, due to the complexity of the problem Duarte propose 
a Tabu algorithm for LOPCC [3] and until now that algorithm is the state of 
the art. In this ongoing research we propose a set of iterated local search 
algorithms (ILS) to solve the LOPCC. The experimental evidence shows 
that the performance of the iterated local search algorithms evaluated have 
similar quality to the best solution reported and get better efficiency than 
the reference solution. Also with the local search algorithms we improve the 
best known solution for 32 instances. Now we are working in developing new 
algorithms with population metaheuristics. 

1 Introduction 

In wireless communication systems it is required that the cell phones com- 
municate constantly with some base station. In order for the base station 
to distinguish between different signals from the cell phones, the Univer- 
sal Mobile Telecommunication Standard (UMTS) adopted the technique of 
Code Division for Multiple Access (CDMA) in this technique each cell phone 
have a different code. Due to the induced distortion because of radio waves 
propagation, each cell phone interferes with each other, so in order to preserve 
the connectivity it is necessary to keep the interference below an acceptable 
level. 

In UMTS exists a technique to reduce the interference called, Successive 
Interference Cancellation (SIC). In this technique each cell phone emits a 
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signal, on a sequential order previously established, and that signal is detected 
by the base station. After each detection, the interference is eliminated for 
the other cell pones, improving the detection for the other users. 

One of the main problems of the SIC technique is the order of detection for 
each cell phone. Usually the users are ordered given their emission level in de- 
crease order, however we could have a better performance considering the inter- 
ference between users. A second problem is the selection of the levels of energy 
a,i at which the cell phone i have to transmit its data. A cell phone that trans- 
mit its data using a high level of energy to transmit its data would be easily 
detected and a cell phone that uses a low level of energy would have a longer 
duration of its battery. Besides, there exists physical an regulation restrictions 
that limit the energy of transmission of any cell phone to a value U . 

Then one faces the problem of finding the order of detection for the users 
on a SIC technique and find the minimal power levels of transmission for each 
cell phone, that assures a proper reception for all the users. This problem is 
called Joint Power Control and Receiver Optimization (JOPCO) and was 
formulated by Benvenuto |1L Using this problem Bertacco formulates the 
Linear Ordering Problem With Cumulative Costs (LOPCC) 0. 

2 Linear Ordering Problem with Cumulative 
Costs(LOPCC) 

Given a complete digraph G = (V, A) with no negative weighted nodes c uv , 
the Linear Ordering Problem With cumulative Costs (LOPCC) consists in 
finding a permutation P = {pi,p2, ■■■,Pn-i,Pn} of nodes, which minimize the 
next objective function. 

l 
LOPCC(P) = Y^ a Pi 



Where: 

Oir, 



— °>Pi T / j \ c PiPj a Pj ) 



Under this context a Pi represents the energy level of the signal coming from 
the cell phone pi . 



3 Related Work 

3.1 Linear Ordering Problem (LOP) 

The linear ordering problem (LOP) is close related with the linear ordering 
problem with cumulative costs, in this section we review some recent works 
related to LOP. 
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Laguna in [4) propose a Tabu search metaheuristic for the linear 
ordering problem. This approach uses a insertion neighborhood along with 
a local search based on a first strategy. Laguna implements intensification 
and diversification besides the ones inherent to tabu search. The additional 
intensification is produced with a pathrelinking algorithm that is applied to 
a set of elite solutions. The additional diversification uses long term mem- 
ory to insert certain elements into a complementary position related to the 
previously selected positions. Laguna propose a balance in the intensification 
and diversification methods so that an algorithm should not waste time in 
methods that does not produce results, this observation could be applied to 
any algorithm that have more than one method. 

Schiavinotto in [S] approach the linear ordering problem (LOP) using the 
insertion neighborhood. In his work he propose insertion movements by us- 
ing consecutive swap movements. Schiavinotto uses two metaheuristics al- 
gorithms to solve the LOP: an Iterated Local Search (ILS) and a Memethic 
Algorithm (MA). The local search to both methods is LSf which is a method 
that stops some where between first and best improvements. The memethic 
algorithm does not uses mutation due to its inherent diversity. His reported 
experiments shows that the Menretic Algorithm have a better performance, in 
time and deviation from the optimum, than the Iterated Local Search (ILS). 
This results were validated using a Wilcoxon test with a = 0.01. However 
there were not enough evidence that the MA had better performance than 
ILS for instances of size 250. A very important element of Schiavinotto's work 
is the economical way to recalculate the objective function value, and that is 
possible because of the order in which the neighborhood is visited. 

3.2 Linear Ordering Problem with Cumulative Costs (LOPCC) 

Now we review the works directly related to the Linear Ordering Problem 
with Cumulative Costs (LOPCC). Bertacco in [5] propose a problem formu- 
lation and two exact algorithms to solve LOPCC. One of them is based on 
Mixed Integer-Linear programming, while the other is an enumerative ad-hoc 
algorithm that produce the best results. 

Righini in [5] propose a polynomially computable lower bound algorithm 
that can be embedded in a branch-and-bound algorithm, also he presents a 
constructive heuristic based on the truncation of the resulting branch-and- 
bound algorithm. Righini reports the branch-and-bound algorithm with lower 
bounds on instances of 16 elements, as a result he shows an average time of 
0.96 and a error deviation from the optimal of 52%. 

Duarte in [3] propose a tabu search algorithm to solve the LOPCC, similar 
to the algorithm proposed by Laguna on [?]. In this work it is an initial 
construction based on a greedy algorithm and a first local search on an 
insertion neighborhood. Duarte reports results on instances of size 150. 

The intensification phase uses a candidate list based on a score calculated 
as follows, view Eq. ([1]). In this way it is given priority to those elements 
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that have a lesser score and doing that it is expected that the intensification 
produces lower values of the objective function. 

score(i) = \^ Cijdj + \^ Cjid%. (1) 

3 3 

The diversification method uses a long term memory, which allows the algo- 
rithm to select an element to be inserted in other position. The probability of 
selecting an element is inversely proportional to the quantity of times that the 
item was selected in the phase of intensification. The reported experiments 
were done with 25 UMTS instances of each scenario; where the scenarios are 
instances synchronous and asynchronous, with and without scrambling. Also 
there were 49 LOLIB instances and 75 Random instances with a uniform 
distribution between and 100 of different sizes, 25 instances of size 35, 25 
of size 100, and 25 of size 150. A comparative study was done against the 
enumerative method proposed by Bertacco j2], a Random Greedy method 
proposed by Benvenuto [lj and an adaptation of the Tabu search proposed 
by laguna g]. 

We begins our research with a preliminary evaluation of the algorithm pro- 
posed by Duarte to identify possible improving areas. Then an Iterated Local 
Search approach was used to evaluate strategies to improve the reported results. 

4 Solution Proposal 

4.1 Iterated Local Search (ILS) 

In this section we describe the Iterated Local Search algorithms proposed. 
The main methods used on the algorithms are the initial solution construc- 
tion, the neighborhood, the local search and the solution perturbation. 

The initial solution construction is made by a greedy algorithm Greedy- 
HeuristicConstruction(). It generates 10 solutions which are built in inverse 
order. For each one, the last element of the permutation is randomly selected 
and from there the elements are selected using a roulette that gives preference 
to those elements that produces the lower cost if they are placed at the next 
position of the permutation. Once the new element is placed some where, 
the algorithm verifies if that element could produce a better solution if its 
inserted on another position that was already taken by other selected ele- 
ment. This construction implies checking the costs of the objective function 
for each element that could be on the permutation at the next position. This 
is an expensive process but permits that the algorithm produces good quality 
solutions. 

Other initial constructions used consists of selecting in a deterministic 
way an initial solution. This construction is based on the score proposed by 
Duarte, where the elements with a lower score would be placed on the lasts 
positions of the permutation, this algorithm is named as ILS_C. 
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All the ILS algorithms uses the insertion neighborhood. This neighborhood 
takes an element and it is inserted into another position of the permutation, 
preferably on a position that produces a better objective function value. 

The local search tries to place every element of the permutation somewhere 
else that gives a better objective function value. Given an element the position 
where it must be inserted is determined reviewing the neighborhood from the 
element position i to the left and continuing from element position i to the 
right, making swap moves. 

In the solution perturbation phase it is selected a random element and is 
placed in some other position. It is intended that the new position improves 
the actual solution, however if that was not possible then it is selected the 
best none-improving position. 

The local search method is used to move the actual solution to an optimal 
local solution. It is applied after the initial solution construction and the so- 
lution perturbations. The algorithm that uses the GreedyHeuristicConstruc- 
tionQ and a random perturbation perturbation is named ILS_B. The general 
structure of the ILS algorithms is the next, see fig.([T]): 



p = GreedyHeuristicConstruction() 
p = LocalS earch(p) 

Pbetter = P 

While globallterations without improve < maxlterations 

{ 

p = Perturbation (p) 
p = LocalS earch(p) 

if (p < Pbetter) 

{ 

Pbetter = P 
} 

else 
{ 

if (p > Pbetter * (3) 
{ 

P = Pbetter 
} 
} 
} 



Where: 

13 is a numerical value between 1 and 2. 

Fig. 1. General Structure of the ILS algorithm 
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5 Experimental Results 

All the algorithms were implemented in C and all the experiments were made 
on a computer with a Xenon 3.06 GHz processor and 4 GB of RAM. The 
instances used on the experiments were the same as those used by Duarte in 
®. 

Because the reference algorithm is the one proposed by Duarte and it uses 
one seed for the random numbers and a time limit of 60 seconds, we did our 
experiments on the ILS algorithms with the same parameters. 

The tables [U 013] and H] show the results obtained by the UMTS instances 
for which are known the optimal values. This results shows that the ILS_B 
algorithm is slightly worse than the TS_Duarte algorithm in quality. But 
in efficiency the ILS_B is better than the reference algorithm. The ILS_B 
algorithm outperforms the average efficiency of the TS_Duarte by 73%. 

Table 1. 25 UMTS instances synchronous and without scramble of size 16 



I TS Duarte ILS B ILS C 



Objective Fun. 


6.24 


6.24 


6.28 


Avg. Error 


0% 


0% 


0.76% 


Standard Dev. 


0% 


0% 


2.13% 


Number of opt. 


25 


25 


18 


CPU Seconds 


0.06 


0.01 


0.01 



Table 2. 25 UMTS instances synchronous and with scramble of size 16 



TS Duarte ILS B ILS C 



Objective Fun. 


14 


14.04 14.02 


Avg. Error 


0.03% 


0.34% 0.12% 


Standard Dev. 


0.2% 


1.02% 0.35% 


Number of Opt. 


24 


22 22 


CPU seconds 


0.03 


0.01 0.01 



Table 3. 25 UMTS instances asynchronous and without scramble of size 16 



TS Duarte ILS B ILS CI 



Objective Fun. 


12.43 


12.43 12.79 


Avg. Error 


0.03% 


0.39% 2.57% 


Standard Dev. 


0.1% 


1.87% 5.36% 


Number of Opt. 


24 


21 16 


CPU seconds 


0.03 


0.01 0.01 
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Table 4. 25 UMTS instances asynchronous and with scramble of size 16 



TS Duarte ILS B ILS C 



Objective Fun. 


19.05 


19.31 19.20 


Avg. Error 


0.06% 


0.82% 1.46% 


Standard Dev. 


0.3% 


2.14% 3.93% 


Number of Opt. 


24 


19 19 


CPU seconds 


0.03 


0.01 0.01 



The table [5] shows the experimental results on the LOLIB instances for 
which no optimal solutions are known. This table is shows that the ILS_B 
algorithm finds new best known solutions for 29 instances and outperforms 
the average Error of the TS_Duarte by 38.42% and the efficiency by 48.92%. 

Table 5. 49 LOLIB instances of sizes 44(28), 50(2), 56(11) y 60(3) 



||TS Duarte ILS B ILS C 



Objective Fun. 


19650.88 


19949.41 


373356.87 


Avg. Error 


56.38% 


17.96% 


139.75% 


Standard Dev. 


262.03% 


59.76% 


399.46% 


Best known 


30 


29 


19 


CPU seconds 


2.31 


1.18 


1.57 



The tables [7] and |S] show the results of the random instances for 35, 100 
and 150 elements for which no optimal solutions are known. The ILS algo- 
rithms were able to find new best known solutions for 20 random instances, 
but at the same time the average error grows with the size of the instances. 

Table 6. 25 random instances of size 35 



TS Duarte ILS B 



ILS C 



Objective Fun. 


0.34132 


0.34444 


0.37812 


Avg. Error 


0.36% 


1.76% 


4.8% 


Standard Dev. 


0.99% 


5.84% 


7.79% 


Best known 


20 


15 


11 


CPU seconds 


0.39 


0.32 


0.35 



On the randonr instances of size 35 the ILS_B algorithm finds 3 new 
best known solutions, while the ILS_C algorithm finds 4 new best known 
solutions. The average efficiency of the ILS algorithms improves by 14% the 
average efficiency of the reference algorithm. 

On the random instances of size 100 the ILS_B algorithm finds 7 new best 
known solutions and the ILS_C algorithm finds 1 new best known solutions. 



402 



J.D. Teran Villanuova et al. 



Table 7. 25 random instances of size 100 



TS Duarte ILS B 



ILS C 



Objective Fun. 


1197.2316 


1321.82744 


1676.55224 


Avg. Error 


2.667% 


9.528% 


25.683% 


Standard Dev. 


7.093% 


9.064% 


33.559% 


Best known 


17 


7 


1 


CPU seconds 


30.75 


32.78 


26.56 



Table 8. 25 random instances of size 150 



TS Duarte ILS B 



ILS C 



Objective Fun. 


2250558.454 


3020078.359 


490485.4494 


Avg. Error 


3.784% 


14.638% 


44.956% 


Standard Dev. 


7.737% 


15.638% 


30.307% 


Best known 


18 


6 


1 


CPU seconds 


180.43 


73.25 


81.84 



Table 9. The best known solutions updated for the LOLIB instances 



Instance 



Value Instance 



Value Instance 



Value 



be75eec 


5.09 


t65nllxx 


2.09 


t75illxx 


4455.05 


be75np 


16543623.61 


t65wllxx 


19.26 


t75kllxx 


1.32 


be75oi 


3.07 


t69rllxx 


14.04 


t75nllxx 


9.9 


be75tot 


297138.99 


t70bllxx 


93.67 


t75ullxxa 


0.068427 


stabu70 


14.86 


t70dllxx 


79.74 


tiw56n54 


2.64 


stabu74 


14.03 


t70dllxxb 


4.44 


tiw56n58 


3.62 


stabu75 


9.42 


t70fllxx 


1.27 


tiw56n62 


3.02 


t59bllxx 


76262.11 


t70illxx 


121146.03 


tiw56n66 


2.69 


t59dllxx 


4086.33 


t70kllxx 


0.5 


tiw56n67 


1.88 


t59fllxx 


24.14 


t70111xx 


798.92 


tiw56n72 


1.57 


t59illxx 


621682.25 


t70nllxx 


0.05 


tiw56r54 


2.63 


t59nllxx 


1618.9 


t70ullxx 


35490330260 


tiw56r58 


3.6 


t65bllxx 


28230.62 


t70wllxx 


0.04 


tiw56r66 


2.19 


t65dllxx 


3898.61 


t70xllxx 


0.23 


tiw56r67 


1.54 


t65fllxx 


1.25 


t74dllxx 


4.76 


tiw56r72 


1.35 


t65illxx 


826127.4 


t75dllxx 


5.06 






t65111xx 


2657.72 


t75ellxx 


2063.41 
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Table 10. The best known solutions updated for the aleatory instances 



Instance 


Value Instance 


Value Instance 


Value 




tld35.1 


0.931 


tldlOO.l 


263.066 


tldl50.1 


9652.803 


tld35.2 


0.167 


tldl00.2 


297.27 


tldl50.2 


216751.9 


tld35.3 


0.154 


tldl00.3 


1431.21 


tldl50.3 


772872.52 


tld35.4 


0.196 


tldl00.4 


8864.08 


tldl50.4 


88416.64 


tld35.5 


1.394 


tldl00.5 


172.11 


tldl50.5 


87690.014 


tld35.6 


0.2 


tldl00.6 


479.3 


tldl50.6 


58579.73 


tld35.7 


0.12 


tldl00.7 


6638.95 


tldl50.7 


172627.38 


tld35.8 


0.226 


tldl00.8 


3095.19 


tldl50.8 


353334.04 


tld35.9 


0.436 


tldl00.9 


75.193 


tldl50.9 


403646.64 


tld35.10 


0.205 


tldlOO.10 


174.44 


tldl50.10 


146736.66 


tld35.11 


0.369 


tldlOO.ll 


256.74 


tldl50.11 


13900.04 


tld35.12 


0.235 


tldl00.12 


246.92 


tldl50.12 


82560.127 


tld35.13 


0.196 


tldl00.13 


629.248 


tldl50.13 


102720.712 


tld35.14 


0.138 


tldl00.14 


279.38 


tldl50.14 


95393.684 


tld35.15 


1.376 


tldl00.15 


458.02 


tldl50.15 


352880.29 


tld35.16 


0.287 


tldl00.16 


771.94 


tldl50.16 


24497326.54 


tld35.17 


0.2 


tldl00.17 


783.01 


tldl50.17 


98564.17 


tld35.18 


0.384 


tldl00.18 


704.937 


tldl50.18 


767813.83 


tld35.19 


0.236 


tldl00.19 


249.29 


tldl50.19 


96627.16 


tld35.20 


0.068 


tldl00.20 


272.82 


tldl50.20 


2279010.02 


tld35.21 


0.202 


tldl00.21 


232.05 


tldl50.21 


48116.464 


tld35.22 


0.177 


tldl00.22 


176.64 


tldl50.22 


866394.89 


tld35.23 


0.346 


tldl00.23 


1724.49 


tldl50.23 


23972543.85 


tld35.24 


0.132 


tldl00.24 


535.28 


tldl50.24 


119426.11 


tld35.25 


0.143 


tldl00.25 


698.71 


tldl50.25 


462316.51 



The average efficiency of the ILS algorithms improves by 4% the average 
efficiency of the reference algorithm. 

On the instances of size 150 even though the TS_Duarte is fixed to 60 
seconds it takes 180 seconds to solve the instances, that is one of the reasons 
for its better quality results. Nevertheless the ILS_B algorithm finds 6 new 
best known solutions and the ILS_C algorithm finds 1 new best known so- 
lutions. The average efficiency of the ILS algorithms for random instances of 
size 150 improves by 57% the TS_Duarte algorithm. 

6 Conclusions 



This ongoing investigation, studies the lineal ordering problem with cumu- 
lative costs and uses Iterative Local Search (ILS) algorithms. Besides that 
the ILS algorithms proposed in this work are still under development and 
need tuning, they tend to work as well for small instances as the state of 
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art algorithms and get even better efficiency. Meanwhile for bigger instances, 
their quality is reduced but at the same time some algorithms find new best 
known solutions than the ones reported by Duarte. We show an update to 
the best known solutions reported by Duarte in (3], we remark in bold the 
new best known solutions found in this work, see tables [5] and If 01 
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Abstract. This paper addresses the analysis and design of an observer in order 
to ease the difficulty of working with various variables for the design of type-1 
fuzzy controllers. In this paper Fuzzy Lyapunov Synthesis, based on the observed 
system, is extended to the design of type- 1 fuzzy logic controllers for nonsmooth 
mechanical systems. The output regulation problem for a servomechanism with non- 
linear backlash is proposed as a case of study. The problem at hand is to design a 
feedback controller so as to obtain the closed-loop system in which all trajecto- 
ries are bounded and the load of the driver is regulated to a desired position while 
also attenuating the influence of external disturbances. The servomotor position is 
the only measurement available for feedback; the proposed extension is far from 
trivial because of the nonminimum phase properties of the system. 

1 Introduction 

A major problem in control engineering is a robust feedback design that asymptot- 
ically stabilizes a plant while also attenuating the influence of parameter variations 
and external disturbances. In the last decade, this problem was heavily studied and 
considerable research efforts have resulted in the development of systematic design 
methodologies for nonlinear feedback systems. A survey of these methods, fun- 
damental in this respect is given in [13|. A novel approach to the modeling and 
identification of elastic robot joints with hysteresis and backlash is reported in l23l . 

O. Castillo et al. (Eds.): Soft Computing for Intell. Control and Mob. Robot., SCI 318, pp. 405- |42lJ 
springerlink.com © Springer- Verlag Berlin Heidelberg 2010 
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The design of fuzzy logic systems is a heavy task that fuzzy logic practitioners 
face every time that they try to use fuzzy logic as a solution to some problem, the 
design of fuzzy logic systems implies at least two stages: design of the rule-base 
and design of the membership functions (MFs). 

There has been some publications in the design of type-1 fuzzy logic systems 
(T1FLS), for example ifTTj presents Genetic Algorithms (GA) as an optimization 
method of control parameters, the authors optimize parameters of the closed-loop 
system but not of the T1FLS. In 1 1 6 1 GAs are used to optimize all the parameters of 
a T1FLS. In [ 18] a hybridizing of Neural Networks and GAs is presented to optimize 
a TIFLSs. A Hierarchical Genetic Algorithms is proposed in [4| to optimize rules 
and MFs parameters of a T1FLS. 

The use of fuzzy systems in the control of mechanisms with backlash is a topic 
that has been treated in recent years, for example in ifTOll an energy-compensated 
fuzzy swing-up and balance control is investigated for the planetary-gear-type in- 
verted pendulum. In the present paper the output regulation problem is studied for an 
electrical actuator consisting of a motor part driven by a DC motor and a reducer part 
(load) operating under uncertainty conditions in the presence of nonlinear backlash 
effects. The objective is to drive the load to a desired position while providing the 
roundedness of the system motion and attenuating external disturbances. Because 
of practical requirements (see e.g.,[ 15]), the motor's angular position is assumed to 
be the only information available for feedback. 

This problem was first reported in 1 1 ], where the problem of controlling nonmin- 
imum phase systems was solved by using nonlinear H^ control, which not does pro- 
vides robustness evidence. In [9], authors report a solution to the regulation problem 
using T1FLS as the controller. In [6] and |5], authors report solutions using T2FLS 
controllers, and making a genetic optimization of the membership function's pa- 
rameters, but do not specify the criteria used in the optimization process and GA 
design. In [7], a comparison of the use of GAs to optimize type-1 and type-2 FLS 
controllers is reported, but a method to achieve this optimization is not provided, 
but in [8] the authors provides a method to achieve this optimization. 

The solution that is proposed in this paper is to design an observer for the math- 
ematical model of our case of study in order to minimize the number of variables 
involved in the system and to make the design of type-1 FLSs (Fuzzy Logic Con- 
trollers -FLCs-) extending the Fuzzy Lyapunov Synthesis ifTTI , a concept that is 
based on the Computing with Words [ 19]|24| approach of the Lyapunov Synthesis 
fl4l . this approach was reported in Q and J2| for the design of stable FLCs. 

The paper is organized as follows. The dynamic model of the nonminimum phase 
servomechanism with nonlinear backlash is presented in Section [2] Section [3] ad- 
dresses the problem statement. Section [4] discusses the observer design. The de- 
sign of fuzzy logic controllers using the Fuzzy Lyapunov Synthesis is presented in 
Section [5J Experimental results for the designed FLSs are presented in Section [6] 
Concluding remarks are collected in Section[7] 
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2 Dynamic Model 

The dynamic model of the angular position qi(t) of the DC motor and the q (t ) of 
the load are given according to 

J N- l q + f N- l q = T + Wo 
Mi + Mi + T = %m + Wi, 

hereafter, Jq, /o, qo an d qo are, respectively, the inertia of the load and the reducer, 
the viscous output friction, the output acceleration, and the output velocity. The in- 
ertia of the motor, the viscous motor friction, the motor acceleration, and the motor 
velocity are denoted by /,-, ft, q\ and qt, respectively. The input torque T„, serves as 
a control action, and T stands for the transmitted torque. The external disturbances 
Wi(t), wo(t) have been introduced into the driver equation (Q} to account for desta- 
bilizing model discrepancies due to hard-to-model nonlinear phenomena, such as 
friction and backlash. 

The transmitted torque T through a backlash with an amplitude j is typically 
modeled by a dead-zone characteristic [21, p. 7]: 

f \Aq\ < j 

T{Aq) = { ' (2) 

I KAq — Kjsign(Aq) otherwise 

with 

Aq = qi-Nq , (3) 

where K is the stiffness, and N is the reducer ratio. Such a model is depicted in 
Fig. 03 Provided the servomotor position qi(t) is the only available measurement 
on the system, the above model (OH)-© appears to be non-minimum phase because 
along with the origin the unforced system possesses a multivalued set of equilibria 
(qt,qo) with q t = and q E [-j,j\. 

To avoid dealing with a non-minimum phase system, we replace the backlash 
model (0 with its monotonic approximation: 

T = KAq-Kr\{Aq) (4) 

where 



l-exp{-f} 

-2j ^-f|. (5) 

l+exp{-f} 



The present backlash approximation is inspired from 12011 . Coupled to the drive 
system (Q]) subject to motor position measurements, it is subsequently shown to 
continue a minimum phase approximation of the underlying servomotor, operat- 
ing under uncertainties W{(t), Wo(t) to be attenuated. As a matter of fact, these un- 
certainties involve discrepancies between the physical backlash model (|2| and its 
approximation (O and (0. 
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-j 




/ 


x^" 


j 


Aq 
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(b) 

Fig. 1. a) The dead-zone model of backlash; b) The monotonic approximation of the 
dead-zone model 

3 Problem Statement 



To formally state the problem, let us introduce the state deviation vector x 

[xi,X2,X-},X4] with 
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x\ = qo-qd, 

X2 = 4o, 

x 3 = qi-Nq d , 

X4 = qi, 

where x\ is the load position error, x 2 is the load velocity, X3 is the motor position 
deviation from its nominal value, and X4 is the motor velocity. The nominal motor 
position Nq c i has been pre-specified in such a way to guarantee that Aq = Ax, where 

Ax = X3 — Nx\ . 

Then, system (HJ-©, represented in terms of the deviation vector x, takes the form 



(6) 



(7) 



x\ =x 2 , 

x 2 = J ' [KNx 3 - KN 2 x x - f Q x 2 + KNr) (Aq) + w ] , 

XT, = X4, 

x 4 = Jj~ l [t„, + KNx\ - Kxj, - fix 4 + Kf] (Aq) + w { ] . 

The zero dynamics 

x\ = x 2 , 

x 2 = J l [-KN 2 X1 - f x 2 + KNT](-Nxi )} , 

of the undisturbed version of system (0 with respect to the output 

y = x 3 (8) 

is formally obtained (see [121 for details) by specifying the control law that main- 
tains the output identically to zero. The following result, extracted from ll22l . 
guarantees that the error system (|6]) and ([8]) is globally minimum phase. 

Theorem 1. Let the nonlinearity r\ be governed by (0. Then system is globally 
asymptotically stable. 

Proof. To begin with, let us consider a Lyapunov function of the form 

V(xi,x 2 ) = \xl + ±KN 2 J l x 2 



2 A 2 ' 2 JVJV J A \ 

e^l/j + l _ Nx£\ ( 9 ) 

2 j 



+2j 2 KJ- l (l\n'- 



Since 



e Nx 1 /j+l + l N 

21n > — - 

2 " j 
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for all x\ 6 R, by inspection, the function V, governed by (O, appears to be positive 
definite. 

Then, let us compute the time derivative of the Lyapunov function on the 
trajectories of © 

V = -J l (KN 2 Xl +ftf 2 + 2jKNf^)x2 
+KN 2 J Q l xix 2 

+2j KJ [ij^+lf 2 ' J ) 

= -J l f X 2 <0. 

Now let us observe that the zero-dynamics system (0) has no trivial solutions on the 
manifold X2 = 0, where the time derivative of the Lyapunov function equals to zero. 
Indeed, if X2 — 0, then due to © 

Nxi l-^H 

— - + 2 rrv- = (11) 

thus concluding that x\ = 0. To reproduce the latter conclusion, it suffices to 
represent (TTTb in terms of v = Nx\ /j 

\-e v 
v + 2 TT - 7 ^0 (12) 

and note that the left-hand side of (fT2l is a strictly increasing function of v because 
its derivative is positive definite by inspection 

4e v (l-e v ) 2 

for all v ^ 0. 

In order to complete the proof, it remains to apply the LaSalle-Krasovski 
invariance principle to the system in question. 

The objective of the Fuzzy Control output regulation of the nonlinear driver sys- 
tem (1) with backlash © and ([5]), is thus to design a Fuzzy Controller so as to 
obtain the closed-loop system in which all these trajectories are bounded and the 
output qo(t) asymptotically decays to a desired position q c / as t — * °° while also 
attenuating the influence of the external disturbances w,(?) and Wo(t). 



4 Observer Design 

Whereas the design of fuzzy controllers is a task that increases the degree of 
difficulty as the number of variables increases, to reduce the number of system 
variables ©, we propose the following observer: 
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X\ =X2, 



-W- ^2, ,_- , nr-tJT-S ....! d4) 



X 2 = Jo[KNx 3 - KN 2 x Y - fox~ 2 + ffJV?] (4g) + w„] 



where 



4# = z\x = x3 — AftTi. (15) 

The observer JT4l)-(fT5l) reduces to two the original four variables of ©. To verify 
that (TBI) behaves as (O we define the error variables: 



e\ = e 2 

e 2 =J l [-KN 2 ei -f e 2 +KNri{Aq) 



(16) 



Then, the following candidate Lyapunov function is proposed 

\e 2 + \KN 2 J Q l e 2 



V(e u e 2 ) = \e 2 2 + \KN 2 jQ l e 2 



+2j 2 KJ l ( 



2 

whose time derivative is 



2\ n e^lll±l _^£l\ ( 17 ) 



V = -J Q l (KN 2 e x +f oe2 + 2jKNf^) e 2 
+KN 2 J l e { e 2 

I Tj2r r l ( 4Ne Ne </j Ne 2 \ 

+2j KJ o [ 2 j(^/j+i) e2 ~~r) 
= -J Q l h4 < o. 



(18) 



In order to complete the proof, it remains to apply the LaSalle-Krasovski invariance 
principle to the system in question, concluding asymptotic stability. 

5 Design of the Fuzzy Logic Controllers 

To apply the Fuzzy Lyapunov Synthesis, we assume the following: 

1. The system has really two degrees of freedom referred to as x\ and x 2 , 
respectively. Hence by <fT4b . x\ = x 2 . 

2. The states x\ and x 2 are the observed variables. 

3. The exact equations (Q}-© are not necessarily known. 

4. The angular acceleration x 2 is proportional to x,„, that is, when % m increases 
(decreases) x 2 increases (decreases). 

5. The initial conditions x(0) = (xi(0),x 2 (0)) T belong to the set N ={ieR 2 :j| 
x — x* || < e} where x* is the equilibrium point and x = [x\ ,x 2 ] . 

The control objective is to design the rule-base as a fuzzy controller x m = T m (xi,x 2 ) 
to stabilize the system Q])-©. 

Theorem |2] that follows establish conditions that help in the design of the fuzzy 
controller to ensure asymptotic stability. The proof can be found in 1 14]. 
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Theorem 2 (Asymptotic stability 1 14|). Consider the nonlinear system (E])-© with 
an equilibrium point at the origin, i.e., f(0)=(0), and let x £ N, then the ori- 
gin is asymptotically stable if there exists a scalar Lyapunov function V(x), with 
continuous partial derivatives such that 

• V (x) is positive definite 

• V(x) is negative definite. 

The fuzzy controller design proceeds as follows. Let us introduce the Lyapunov 
function candidate 

V(x l ,x 2 ) = ^(xi 2 + x 2 2 ), (19) 

which is positive-definite and radially unbounded function. The time derivative of 
V{x\,x~2) results in: 

V (fi ,X 2 ) = X\ X\ + X 2 X 2 — X\ X 2 + X 2 X 2 , (20) 

To guarantee stability of the equilibrium point (x^x^V — (0j0) r we wish to have: 

X\X~2 + X 2 X 2 < 0. (21) 

We can now derive sufficient conditions so that inequality (f2TT> holds: If x\ and x 2 
have opposite signs, then x\x 2 < and (f2TT> will hold if x 2 = 0; if x\ and x 2 are both 
positive, then (f2TT> will hold if x 2 < —x\ ; if x\ and x 2 are both negative, then d2TT) 
will hold if x~2 > —x~\ . 

We can translate these conditions into the following fuzzy rules: 

• If x\ is positive and x 2 is positive then x 2 must be negative big. 

• If x\ is negative and x 2 is negative then x 2 must be positive big. 

• If x\ is positive and x 2 is negative then x 2 must be zero. 

• If x\ is negative and x 2 is positive then x 2 must be zero. 

However, using our knowledge that x 2 is proportional to u, we can replace each x 2 
with u to obtain the following fuzzy rule-base for the stabilizing controller: 



If x\ is positive and x 2 is positive then u must be negative big. 
If xi is negative and £2 is negative then m must be positive big. 
\fx\ is positive and ^ is negative then m must be zero. 



• If xi is negative and £2 is positive then m must be zero. 

This fuzzy rule-base can be represented as in Table Q] 

It is interesting to note that the fuzzy partitions forxi, x 2 , and u follow elegantly 
from expression (120b . Because V = f 2 (x\ +x 2 ), and since we require that V be 
negative, it is natural to examine the signs of xi and x 2 ; hence, the obvious fuzzy 
partition is positive, negative. The partition for x 2 , namely negative big, zero, pos- 
itive big is obtained similarly when we plug the linguistic values positive, negative 
for x\ and x 2 in (l20l . To ensure that x 2 < —x\ (x 2 > —x~i) is satisfied even though 
we do not know x\ 's exact magnitude, only that it is positive {negative), we must set 
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Table 1. Fuzzy IF-THEN rules 



No. error change of error control 



1 


positive 


positive 


negative big 


2 


negative 


negative 


positive big 


3 


positive 


negative 


zero 


4 


negative 


positive 


zero 



X2 to negative big (positive big). Obviously, it is also possible to start with a given, 
pre-defined, partition for the variables and then plug each value in the expression 
for V to find the rules. Nevertheless, regardless of what comes first, we see that 
fuzzy Lyapunov synthesis transforms classical Lyapunov synthesis from the world 
of exact mathematical quantities to the world of computing with words lT24ll . lfT9ll . 

To complete the controller's design, we must model the linguistic terms in the 
rule-base using fuzzy membership functions and determine an inference method. 
Following [3J, we characterize the linguistic terms positive, negative, negative big, 
zero and positive big. The type-1 MFs are depicted in Fig. [2] for a type-1 fuzzy 
logic controller. To this end, we had systematically designed a FLC following the 
Lyapunov stability criterion. 

6 Results 

In this section we present results for the closed-loop systems for the type-1 fuzzy 
logic controllers. To perform experiments we use the dynamical model ([Til-©, 
which involves a DC motor linked to a mechanical load through an imperfect con- 
tact gear train [ 1 ], Fig.[5]show the systems testbed. The parameters of the dynamical 
model (TJJ-© are in Tabled while N = 3,j = 0.2 [rad], and K = 5 [N-m/rad]. 

Table 2. Nominal parameters 



Description Notation Value Units 



Motor inertia 


Ji 


2.8 xl(T 6 Kg-m 2 


Load inertia 


Jo 


1.07 Kg-m 2 


Motor viscous 






friction 


fi 


7.6xl0~ 7 N-m-s/rad 


Load viscous 






friction 


fo 


1.73 N-m-s/rad 



The experimental setup involves a DC motor linked to a mechanical load through 
an imperfect contact gear train. Fig. |4| shows the control diagrams, showing the 
locations of sensors, actuators and load. 
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negative 



positive 




-1 1 

domain x 



-1 1 

domain x 





-1 1 

domain x 



-1 1 

domain x 



0.5 



-1 1 

domain x 




-1 1 

domain x 




1 

domain x 



Fig. 2. Set of type-1 membership functions (First row for x\ , second row for x^ and third row 
for u) 




Fig. 3. Physical testbed 



An Observer for the Type-1 Fuzzy Control of a Servomechanism with Backlash 415 






I 



lHHg°-"l8i 



n :■■--• 




Lnwr 



":.-.x..;- 



Loed Fndtitm 



Fig. 4. Control diagram 



With the fuzzy rule-base of TableQ]and the MFs depicted in Fig.|2j we obtain the 
Control Surface of Fig.[5j 




Fig. 5. Control surface for the T1FLC 
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Applying this T1FLC to the proposed problem, we obtain the system's response 
of Fig. [6] that is, qo — > qj while x\ — > 0, this convergence is due to the behavior of 
f\ and xi, this convergence if in Fig. [TJ To emphasize the fact that the system (fT4l) 
observes ©, Fig. [8] shows the behavior of the variables e\ and e'2. Fig. |9] shows the 
behavior of the Lyapunov Function V (fT9b and its derivative V (l2Qb . 






Q. 2 



I / \ 
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I 
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■ motor 
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Fig. 6. System's response for the T1FLC 
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Fig. 7. x\ and x~2 
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Fig. 8. e'i and ei 
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Fig. 9. V and V 
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Table [3] concentrates some measures that can help us to understand the results 
presented in this paper. 

Table 3. T1FLC results 



Settling time Overshoot ISE IAE ITSE ITAE 



15 s 



2.319 63233 19697 2639600 827260 



7 Conclusion 



The main goal of this paper was to design an observer for the mathematical model 
of the case of study in order to propose a systematic methodology to design type-1 
fuzzy logic controllers to solve the output regulation problem of a servomechanism 
with nonlinear backlash. 

We proposed an observer for the system and we did a demonstration of its effec- 
tiveness through a Lyapunov stability test. 

From the observed variables we design, using fuzzy Lyapunov synthesis, the rule 
base of a controller that ensures stability for the closed-loop system, and based on 
the design of the rule base, we identify the granulation necessary to know the num- 
ber and characteristics of the membership functions for each variable involved in 
the fuzzy controller. 
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The proposed design strategy results in two controllers that guarantee that the 
load reaches the desired position q c j, solving the regulation problem as was pre- 
dicted, this affirmation is supported with experiments where both, the type-1 FLC 
designed by following the Fuzzy Lyapunov Synthesis achieve solving the regulation 
problem. 
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Abstract. A neuro-fuzzy learning algorithm is applied to design Takagi-Sugeno 
Fuzzy Controllers for a biped robot walking problem. The appropriate control signal 
is provided with a rule to switch between the controllers. A simulation of generation 
of walking motions is presented to illustrate the effectiveness of this approach. 

1 Introduction 

Fuzzy logic control as an application area of Fuzzy Logic has attracted attention 
from scientist and engineers due that through if-then rules can express knowledge 
of how to control a system incorporating uncertainty. The existence of a big num- 
ber of variables in the measurable variables space (system o external variables) 
and the partition needed in these variables, are factors that have influence in the 
size of the rule base, which implies a nontrivial task into the modeling. In order 
to overcome this difficulties, Clustering, Adaptive Neuronal Networks and Neuro- 
fuzzy approaches have been proposed and applied to identification and control tasks 
where the model of the system to control are not available as in Q; for observer 
design proposed in [6] to control of robot manipulators. In biped robot locomotion 
control, adaptive neural networks and neuro-fuzzy methods are implemented where 
uses the Zero Moment Point (ZMP) [ 12 1 information. For example, in iflOl several 
Adaptive-Network-based Fuzzy Inference System (ANFIS)[7] are proposed to de- 
sign a locomotion controller and another one to determine the model of the system. 
A learning algorithm and a grid partition method are proposed in [ 3 ] to obtain a 
walking controller. Neuro-fuzzy approaches applied in ZMP control trajectories are 
proposed in [4]. 
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This paper is related to the application of ANFIS a design tool of an output 
feedback controller such that it can induce walking motions in a biped robot whose 
model consider the impact with the environment. The control loop uses an output 
definition, which include the movement task, if the output is carried to zero the biped 
robot perform the desired motion. Thus, the controller design is based in the off-line 
learning of a nonlinear relationship between the state space, the outputs and the 
control inputs information which are distributed in multiple instances of the learn- 
ing algorithm which produce a set of Takagi-Sugeno type Fuzzy Logic Controllers 
specialized in a subspace. The final control input is defined as a switching function 
dependant of the time. 

The rest of this paper is organized as follows: Section [2] introduces the mathe- 
matical model of the biped robot. In Section|3j describes some aspects of the neuro- 
fuzzy approach. Section H] explains the control strategy used for controller design. 
Section|5j illustrates simulation results of the closed-loop system. Section|6]provides 
some conclusions. 

2 Biped Robot Model 

In this section the dynamical model of a planar biped robot is introduced. The biped 
robot consists of a torso, hips, and two legs of equal length without ankles and knees. 
Two torques are applied between the legs and torso. The definition of the angular 
coordinates and the disposition of the masses of the torso, hips and legs of the biped 
robot are shown in Fig. Q] the positive angles are computed in a clockwise manner 
with respect to the indicated vertical lines and all masses of the links are lumped. 

The walking cycle takes place in the sagittal plane and on a surface level, and it is 
assumed as successive phases where only one leg (stance leg) touching the walking 
surface (swing phase), with the transition from one leg to another taking place in 
a smaller length of time. During the swing phase, the stance leg is modeled like a 
pivot and the swing leg is assumed to move into the frontal plane [9] and to renter the 
plane of the motion when the angle of the stance leg attains a given desired value. 
The assumptions concerning the walking cycle, define the biped robot model in two 
parts: the model that describes the swing phase and another one that describes the 
contact event of the swing leg with the walking surface; which are presented below. 

2.1 Swing Phase Model 

The dynamical model of the robot during the swing phase is a second order system 
derived from the Lagrange method [ 8 ] : 

M(0)0 + C(0,0)0 + G(0)=fl M , (1) 

where = [0i , 02, 0j,] T are the generalized coordinates, 6\ parameterizes the stance 
leg, 02 the swing leg and 03 the torso; u=\u\, U2\ T are the input; M(0) is a positive- 
definite inertia matrix, C(0, 0)0 is the vector of the centripetal and Coriolis forces; 
and G(0) is the vector of the conservative forces and B is the input matrix. 
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Fig. 1. 3-link biped robot 



the state- space form of the second order equation (Q} is defined as: 



d 

x := — 
dt 



e 

M- 1 (0)[-C(0, 6)9 - G(0) +Bu] 



(2) 



f(x) + g(x)u. 



2.2 Impact Model 

The impact between the swing leg and the ground is modeled as the contact between 
two rigid bodies. The main objective is to obtain the velocity of the generalized 
coordinates after the impact of the swing leg with the walking surface in terms of 
the velocity and position before the impact. The impact model for the biped robot 
is based on the rigid impact model of [5] and assuming the case where the contact 
of the swing leg with the walking surface produce either no rebound nor slipping of 
the swing leg, and the stance leg lifting the walking surface without interaction, the 
impact model is expressed with equation fl3l : 



' M e -E T ~ 

E 




\e e + ' 

F 


= 


~M e e-~ 





(3) 



where e — [$i,02,03,zi,Z2] r are the generalized coordinates resulting of the add 
the cartesian coordinates [zi,Z2] r of the end of the stance leg as shown in Figure [U 
M e is the positive-definite inertia matrix of the biped model with the new generalized 
coordinates; F = [Ft,Fn] t represent the tangential and normal forces applied at the 
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end of the swing leg; 0+ is the velocity after the impact; e is the velocity before 
the impact and 



E:= 



dY_ 

dd e 



rcos(0i) -rcos(0 2 ) 1 
-rsin(0i) /-sin(0 2 ) 1 



(4) 



solving for 0+ of equation (O and a change of coordinates must be done and also 
a re-initialization of the model (O. The change of coordinates is an expression for 
x + := ( + , + ) in terms of x~ := ( _ , ~ ) , which is given for the mapping function 
US: 



x + :=A(x~) 



o 2 - 




0f 




3 - 




e+( x - 


) 


e+(x 


-) 


_e+(x 


-) 



(5) 



Thus, the overall model of biped walking is the combination of the swing phase 
model and impact model, as follows: 



3d 



x = f(x) + g(x)u(x) x £ S 



:A(X~) 



x- <ES 



where 



S:={(9,G)\zi >o,z 2 = o,e 1 = ef} 



(6) 

(7) 



3 ANFIS and Fuzzy Models 

ANFIS [7] is a class of adaptive neural network whose functionally is equivalent to 
the T-S fuzzy reasoning mechanism ifTTI . The ANFIS architecture consist on five 
layers and the node function for each node of same layer corresponds to the same 
family functions. The architecture is detailed as follows: 

3.1 Layer 1 

Every node i in this layer is an adaptive node with a node function 

0]=PLa,{x) (8) 

where x is the input to node i and A,- is the linguistic label associated with this 
node, 0\ is the membership function of the fuzzy set A,, it specifies the grade that 
the given value x satisfies the quantifier A. The membership function /^(x) can be 
parameterized for 



MA,-(*)=exp{--(— -) 2 } 
2 a t 

where a, and c,- are referred as premise parameters. 



(9) 
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3.2 Layer 2 

Every node i in this layer is a fixed node labeled 17, whose output is the product 
of all the incoming signals. For example, consider two linguistic variables A and B 
characterized by two fuzzy sets each. Then the output of a node i of this layer is 
obtained by: 

0} = wi = Ma, (x)Hb, GO , i = 1,2. (10) 

each node represent the firing strength of a rule. 

3.3 Layer 3 

Every node in this layer is a fixed node labeled N. The ith node calculate the ratio of 
the fth rule's firing strength to the sum of the all rules' firing strengths. The outputs 
of this layer are called normalized firing strengths (vPi). 

0?=*i = 5$_,» = l,2. (11) 

3.4 Layer 4 

Every node i in this layer is an adaptive node with node function 

Of = mfi = WitpiX+qty + n), i= 1,2. (12) 

where /?,, qi, rj is the parameter set of this node, this are the consequent parameters. 

3.5 Layer 5 

The single node in this layer is a fixed node labeled E, which computes the overall 
output as the summation of all incoming signals: 

Of=I,w,/, = ^f (13) 

The learning rule combines gradient method and least squares estimate to identify 
and update the parameters of the network, these parameters are the premise and 
consequent parameters which describe a Fuzzy Model. 

Thus, ANFIS can serve as basis for constructing a set of T-S type fuzzy model 
ifTTI with appropriate membership functions starting from a set of input-output data 
pairs. Consider a T-S fuzzy model with input variables x\, #2,..., Xj and output 
variables u\, U2,...,ui l , the /th rule takes the form: 

R l : IF x\ is An and X2 is A%\ and ... Xj is A ; -.,- 
THEN 

u\ =G(xi,X2,.-,Xj;piA,pi2,--,pij,pij+i) (14) 

and ... and 

Uk = G(xi,X2,~;Xj\ Pk. 1 , Pk.2 j — i Pk.j , Pk,j+ 1 ) 

where Aj j with i = 1,2,..., mi and j = 1,2, .../M2, are the /th fuzzy set of the 7th 
input variable; Uk with k = 1,2, ...,mj,, each G(-) is an crisp continuous function in 
terms of the inputs with j + 1 parameters p. 
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4 Control Strategy 

This paper uses the output form proposed in [ 13 ], which encode a geometric task for 
the robot, identifies the zero dynamics of the system model including the presence 
of impacts and design a feedback controller that generate exponential stable walking 
motions. The output is defined as follows: 



hi(6,a) 
h 2 (0,a) 



03-hd,i(0i,a) 
dz-h dj2 (9i,a) 



where 



h dil (e u a):=a° 1 + --- + al(8 l ) 



hd,i{Qi,a) 



-0i + (4 + ■ ■ ■ +al(e 1 y) x (0j + of) x (0i - ef ). 



(15) 



(16) 



The control objective is to find a intelligent control law taking the trajectories of the 
closed-loop system during a walking cycle obtained in |fT3ll such that the output is 
driving to zero. 



4.1 Controller Design 

This paper uses ANFIS for the T-S Fuzzy Logic Controller (T-S FLC) design. In 
order to accomplish the control objective, the state trajectories x= [0i,02,03,0i, 
02, 03] T are considered, the output trajectories y — \yi,y 2 ] and the control inputs 
u = \u\, u 2 ] T corresponding to closed-loop system simulation of the work proposed 
in |[T3ll during a walking cycle, to built the data set (input-output data pairs). 

The data set is partitioned to diminish the computational load and the time to the 
algorithm, and then distributed to different instances of the learning algorithm as 
follows: 2 ANFIS take each gth partition and use as input data the state and output 
trajectories; as output data, one of the control inputs. The /th rule of each T-S FLC 
has the form as: 

R 1 : IF 0i is A\ ,■ and 02 is A 2 j and 03 is A3 ,■ and 
01 is A4 j and 02 is A5 ,■ and 03 is A(, ,- and 

V] is Ajj and y 2 is A & j 



THEN 



(17) 



Uk = P1.161 + 1)1,262 + P13O3 + PiaOi 

Pifth + pifify + pijyi +pi,m + Pi, 



where Aj j with i = 1,2 are fuzzy sets for each input variable: Q\, 02, 03, 0i, 02, 
03, y\ and V2 respectively; u^ with k = 1,2 is one control input. The fuzzy sets are 
parameterized as in ©. 

Thus, the overall controller consist of four controllers, each one consists of two 
first order T-S FLC, and each T-S FLC provides one input control. The overall 
scheme of proposed controller is presented in Fig. 12 The closed-loop system is 
shown in Fig. [5J the signal defined as ho(0) are the quantities to control, which 
represent to the first term of each element of the vector defined in ( fT5l) . 
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Fig. 2. Controller Structure 




Fig. 3. Closed-loop system block diagram 
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The control input u = [mj,^] 7 is defined through this function: 



' „1 _ E,w,G,(.c,y;p) 

Ui ~~ —————— 



u(x,y,t) := < 



1 - 


£i »7 


1 


X;tf;Gj(*,y;p) 


2 


£; Wj 


? 


£,»',G,(:t,y;p) 


1 


x,- »•,■ 


2 


S;WjG;(j£,;y;p) 


2 


S,»v 


3 


Y.iWjGj(x,y;p) 


1 


Ei wj 


3 


IiWjG;(x,y;p) 


2 


X,»v 


4 


£,w>,G,- (*,)>;/)) 


1 


Si »•,■ 


4 _ 


I.iW i G i (x,y;p) 



,0<t <t\ 



,t\ <t <t 2 



,t 2 <t <H 



,h<t<U 



(18) 



Zd *i 



where x = [6i , 62, 63, 6\ , 62, 9j,}, y = [yi,^], » = 1,2, ..., Z c , l c is the number of rules 
in each controller; u c k denotes the &th control input provided by cth controller and 
represents the weight average of rules activation within antecedents projected on the 
consequents through a time interval; w, is the firing strength of each rule as in dTob : 
G(x,y,p) is the consequent of the each rule which takes the form as in dT2b and p 
are the consequents parameters of each rule provided by the learning algorithm. The 
total rule base of the controller comprises 2048 if-then rules, considering that the 
universe of discourse of each input variable is represented through two membership 
functions. The parameters which describe each T-S FLC are provided in Section[6] 

5 Simulation Results 



The numerical verification of the closed-loop system was performed in MatLab, in 
a Pentium IV computer with 512 MB of RAM, using the software implementation 
of the biped robot available in [1|. The physical parameters of the biped model 
are shown in Table [U and the impact occurs when Of = tt/8. The time values that 
define the interval in dT8b are shown in Table [3j The parameters of the output ( TT3b - 
(TT~6b are given in Table [2j The matrices of the equations ©-(fJI which describe the 
mathematical model of the biped robot are given at Section[6l 

In Fig. @] to Q show the state trajectories, output trajectories and input control of 
closed-loop simulation where the biped robot to perform one step. 
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Table 1. Biped model parameters 



Parameter 



Mass of Hips (M//) 

Mass of Torso (Mr) 

Mass of Legs (m) 

Length of legs (r) 

Length of torso (/) 



Value 



15 kg 
10 kg 
5 kg 
lm 
0.5 m 



Table 2. Parameters of the output definition y in d 1 6b 



d> 


a\ 


aj 


a\ 


4 


a\ 


a 2 


ai. 


0.512 


0.0730 


0.0350 


-0.8190 


-2.2700 3.2600 3.1100 1.8900 



Table 3. Time values of the control law definition in ill St 



h 


h 


h 


k 


0.098 


0.383 


0.980 


1.121 




0.2 0.4 0.6 0.0 1 1.2 1.4 



=3- 0.5 

of O 

-0.5 

0.6 

=3- 0.55 

ctS° 0.5 

0.45 




0.2 0.4 0.6 0.8 1 1.2 1.4 






0.2 0.4 0.6 0.8 1 1.2 1.4 

i f. time (sec) x f 

12 3 4 



Fig. 4. Joint position trajectories 
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Fig. 5. Joint velocities trajectories 
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Fig. 6. Output trajectories 



Neuro-Fuzzy Based Output Feedback Controller Design for Biped Robot Walking 433 



-20 



-40 



-BO 






0.2 0.4 0.6 0.8 1 1.2 1.4 




0.2 0.4 0.6 0.8 1 1.2 1.4 



t t time (sec) t t 



Fig. 7. Control signals 



6 Conclusions 



In this paper a T-S Fuzzy Logic Controller design to generate walking motions is 
presented. The control strategy consider an output function imposed in the feedback 
and several ANFIS are used to adapt the parameters of T-S FLCs such that fit a set 
of input-output data pairs which represent the evolution of the state variables and 
the output function during a walking cycle. An intelligent control law as switching 
function is proposed whose objective is to drive the quantities to control ho(0) to 
track the output function. The simulations results demonstrate that this method can 
be applied to solve the problem, its effectiveness is related to the capabilities to per- 
form the data fitting and at this time, the controller is ad hoc for the output function 
and biped mathematical model parameters presented within the paper. 
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Appendix 1 

Model Details 

This section completes the equations of the biped model (0 - (O. In the following, 
this is the notation used: 

^i i = sin(0 1 -0 ; ),./e{2,3} 
ci_ / = cos(6i-0/) ) je{2,3} 

Swing phase Model 



M 



(^m + M H + M T )r 2 -\mr 2 c\2 M T rlc\^ 
— ^mr~c\2 \mr 

M T rlc i3 M T l 2 



C 



\mr 2 snQ\ 
-Mrrlstfdi 















\g{2M H + 3m + 2M T )r sin(0i ) 
jgmrsin(62) 
— gM T l sin (0 3 ) 

-10 

B= -1 

1 1 
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Impact Model 

The inertia matrix M e has the entries: 



M l e l = { 5 -m+M H + M T )r 2 

M 12 = --mr 2 c 12 

M l e 3 = M T rlcn 

3 
M l A = (=m + M H +M T )r cos(0i) 

3 
Ml 5 = -(-m + M H +M T )rsm(0i) 



M 22 


1 2 

= —mr 
4 


Mf 


= — —mrcos(Q2) 


M 25 


= -mrsin(02) 


Mf 


= M T l 2 


Mf 


= M r /cos(03) 


Mf 


= — M^/sinf^) 



M^ = 2m + M H + M T 
Mg 5 = 2m + M H +M T 



Appendix 2 

T-S Fuzzy Controllers Parameters 

Antecedent Parameters 

This subsection Table s|4] to [7] show the membership functions parameters of each of 
the controller explained in the section 0] 
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Table 4. Membership function parameters of first Controller 



Input 


Fuzzy 
Set 


T-S FLC 1 
Parameters 


T-S FLC 2 
Parameters 




a 


c 


a 


c 


0i 


M.2 


0.04179 
0.01850 


-0.38862 
-0.28397 


0.04136 
0.01845 


-0.39037 
-0.28763 


e 2 


A 2,l 

A 2 .2 


0.02619 
0.00797 


0.39265 
0.45614 


0.01917 
0.00766 


0.39171 
0.45421 


e 3 


A 3,l 

A 3 .2 


0.00331 
0.03138 


0.47294 
0.53158 


0.02037 
-0.00288 


0.49516 
0.55305 


0i 


Al.2 


0.02612 
0.04841 


0.90943 
0.97931 


0.02432 
0.04458 


0.92174 
0.98509 


02 


A 2 ,i 

A 2 ,2 


0.43232 
0.43512 


-0.24231 
0.79588 


0.43764 
0.43810 


-0.23986 
0.79484 


03 


^3,1 

^3,2 


0.67073 
0.67316 


-0.10894 

1.48427 


0.67329 
0.67532 


-0.10776 
1.48346 


y\ 


A 4 ,i 

M.2 


0.01973 
0.01592 


-0.04125 
0.03541 


0.03504 
0.01244 


-0.03530 
0.04119 


yi 


A 5,l 

^5.2 


0.03453 
0.01242 


-0.03573 
0.02543 


-0.00024 
0.02626 


-0.06683 
-0.00515 



Table 5. Membership function parameters of second Controller 



Input 


Fuzzy 
Set 


T-S FLC 1 
Parameters 


T-S FLC 2 
Parameters 




a 


c 


a 


c 


0i 


A U 

M,2 


0.11912 
0.04469 


-0.26805 
-0.05091 


0.04926 
0.07628 


-0.31383 
-0.10665 


02 


A 2 ,l 

A2,2 


0.00209 
0.03183 


0.43179 
0.49398 


0.02466 
0.02314 


0.43144 
0.49563 


03 


A 3,l 

^3,2 


0.01710 
0.00148 


0.51069 
0.55181 


0.01704 
-0.00160 


0.50994 
0.54978 


0i 


Ai,i 

Al,2 


0.15245 
0.19932 


0.45236 
0.92833 


0.19035 
0.18665 


0.47829 
0.94280 


02 


A 2 ,l 

A 2 ,2 


0.46471 
0.46829 


-0.36901 
0.76003 


0.47436 
0.47100 


-0.36670 
0.75723 


03 


A 3,l 

A3,2 


0.04614 
0.08043 


-0.23211 
-0.01404 


0.09284 
0.07787 


-0.19973 
0.00310 


y\ 


A 4 ,l 
A 4 ,2 


-0.00075 
0.01384 


-0.00595 
0.02182 


0.01248 
-0.00078 


0.00329 
0.03306 


yi 


A 5 ,l 

A 5 ,2 


0.01796 
0.00909 


-0.08025 
-0.02957 


0.01307 
0.01867 


-0.06672 
-0.00895 
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Table 6. Membership function parameters of third Controller 



Input 


Fuzzy 
Set 


T-S FLC 1 
Parameters 


T-S FLC 2 
Parameters 




a 


c 


a 


c 


0t 


M,2 


0.13746 
0.13887 


-0.10592 
0.22189 


0.13768 
0.14065 


-0.10662 
0.22035 


e 2 


A 2,l 

^2,2 


0.23348 
0.23356 


-0.07858 
0.47241 


0.23411 
0.23311 


-0.07802 
0.47289 


3 


A 3 ,2 


0.00812 
0.01100 


0.50028 
0.51772 


0.00461 
0.01351 


0.49942 
0.51900 


0i 


Ai.i 
Al,2 


0.22045 
0.21908 


0.42482 
0.94333 


0.21962 
0.21921 


0.42427 
0.94308 


e 2 


A 2 ,i 

^2,2 


0.69502 
0.69512 


-2.00380 
-0.36665 


0.69503 
0.69490 


-2.00372 
-0.36648 


e 3 


^3,1 

A3.2 


0.02408 
0.03000 


-0.03254 
0.03532 


0.02631 
0.02974 


-0.03178 
0.03535 


y\ 


M,2 


0.00061 
-0.00696 


0.00626 
0.00520 


-0.00204 
-0.00870 


0.01551 
0.00277 


yi 


A 5 ,i 

A 5 ,2 


0.00253 
0.00774 


-0.01839 
0.00191 


-0.00363 
-6.53 x 10~ 5 


-0.00373 
0.00076 



Table 7. Membership function parameters of fourth Controller 



Input 


Fuzzy 
Set 


T-S FLC 1 
Parameters 


T-S FLC 2 
Parameters 




a 


c 


a 


c 


6i 


Ai.i 

M,2 


0.07507 
0.07136 


0.22460 
0.39446 


0.08552 
0.03315 


0.23442 
0.41950 


02 


A 2,l 

^2,2 


0.13168 
0.12964 


-0.39218 
-0.07554 


0.12745 
0.12509 


-0.39340 
-0.07315 


03 


A 3j i 

A 3 ,2 


-0.01008 
0.01417 


0.48029 
0.51326 


0.00837 
0.00808 


0.48193 
0.50781 


0i 


A M 

Al,2 


0.25863 
0.25689 


0.94355 
1.55087 


0.25958 
0.25120 


0.94432 
1.55362 


02 


A 2,l 

M,2 


0.14937 
0.14704 


-2.34638 
-2.00387 


0.15849 
0.14260 


-2.34355 
-2.00340 


03 


A 3,l 

^3,2 


0.16569 
0.17196 


-0.43464 
-0.03286 


0.14283 
0.17654 


-0.44816 
-0.03871 


y\ 


A4.2 


-0.00826 
-0.00034 


-0.00474 
-0.00333 


-0.00815 
-0.00145 


-0.00371 
-0.00814 


yi 


Asa 

M,2 


-0.00932 
-0.00032 


0.00284 
0.00309 


0.00288 
0.00189 


0.00011 
0.01339 



Consequent Parameters 

This subsection presents the consequent parameters of the controller explained in the 
section|4j In Tables[8lto[T3lprovide the consequent parameters of 1 5 1 rules of the first 
controller. 
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Table 8. Consequent parameters of the Controller 1 - T-S FLC 1 (1 of 3) 



R> 


Pl.i 


Pl.2 


Pi, 3 


Pl.4 


Pl.5 


R l 


-2.36 x 10" lu 


2.36 x 10" 1U 


2.98 x 10" 1U 


5.55 x 


10 -io 


-1.48xlO- lu 


R- 


-5.26 x 1CT 11 


5.26 x 10"" 


6.65 x lO" 11 


1.24 x 


io-'° 


-3.29x10-" 


R 3 


-1.55 x IO" 15 


1.55 x IO" 15 


1.95 x 10 l5 


3.64 x 


io- 15 


-9.93 xlO- 16 


R 4 


-3.60 x 1CT 16 


3.60 x IO" 16 


4.54 x IO" 16 


8.47 x 


io-' 6 


-2.28 xlO- 16 


R> 


-4.06 x IO" 09 


4.07 x IO" 09 


5.13 x IO -09 


9.57 x 


10 -09 


-2.54xl0-° 9 


R b 


-9.00 x 10~ 10 


9.00 x IO -10 


1.14 x IO" 09 


2.12 x 


io- 09 


-5.61 x IO" 10 


R 1 


-2.71 x IO" 14 


2.71 x IO" 14 


3.42 x IO" 14 


6.38 x 


io- 14 


-1.73 xlO" 14 


R« 


-6.23 x 10" 15 


6.23 x IO" 15 


7.87 x IO" 15 


1.47 x 


io- 14 


-3.93 x IO" 15 


R 9 


-1.33 x 10"" 


1.33 x lO" 11 


1.68 x 10"" 


3.13 x 


io-" 


-8.40 xlO" 12 


R W 


-2.30 x IO" 12 


3.00 x IO" 12 


3.79 x IO" 12 


7.07 x 


IO" 12 


-1.88 xlO" 12 


R n 


-8.42 x 10"" 


8.42 x 10"" 


1.06 x IO" 16 


1.98 x 


io" 16 


-5.47xlO- 17 


R 11 


-2.00 x 10"" 


2.00 x 10"" 


2.53 x 10"" 


4.72 x 


io- 17 


-1.28X10" 17 


R 13 


-2.30 x IO" 10 


2.30 x IO -10 


2.90 x IO" 10 


5.41 x 


io-'" 


-1.45 xlO" 10 


R u 


-5.15 x lO" 11 


5.15 x KT 11 


6.51 x 10"" 


1.21 x 


10 -io 


-3.22x10"" 


R 15 


-1.49 x IO" 15 


1.49 x IO" 15 


1.88 x IO" 15 


3.51 x 


IO" 15 


-9.61 x IO" 16 


R 16 


-3.49 x IO" 16 


3.49 x IO" 16 


4.41 x IO" 16 


8.23 x 


10 -16 


-2.22 xlO" 16 


R" 


-1.57 x lO -10 


1.57 x IO -10 


1.98 x IO" 10 


3.69 x 


io-'" 


-9.91 x IO" 11 


R 1S 


-3.54 x 10"" 


3.54 x 10"" 


4.47 x 10"" 


8.34 x 


io-" 


-2.22x10"" 


R 19 


-9.98 x IO" 16 


9.97 x IO" 16 


1.26 x IO" 15 


2.35 x 


IO" 15 


-6.48 xlO" 16 


R 20 


-2.37 x IO" 16 


2.37 x IO" 16 


2.99 x IO" 16 


5.58 x 


io- 16 


-1.51 xlO" 16 


R 21 


-2.71 x IO" 09 


2.71 x IO" 09 


3.42 x IO -09 


6.39 x 


IO" 09 


-1.71 x IO" 09 


R 22 


-6.07 x IO" 10 


6.07 x IO -10 


7.67 x IO -10 


1.43 x 


io- 09 


-3.79 xlO" 10 


R 23 


-1.76 x IO" 14 


1.76 x IO" 14 


2.23 x IO" 14 


4.15 x 


io- 14 


-1.14xlO" 14 


R 24 


-4.13 x IO" 15 


4.12 x IO -15 


5.21 x IO" 15 


9.72 x 


IO" 15 


-2.62 xlO" 15 


R 25 


-8.75 x IO" 12 


8.74 x IO" 12 


1.10 x lO" 11 


2.06 x 


io-" 


-5.57 xlO" 12 


R 26 


-2.00 x IO" 12 


2.00 x IO" 12 


2.53 x IO" 12 


4.72 x 


io- 12 


-1.26 xlO" 12 


R 21 


-5.35 x 10"" 


5.34 x 10"" 


6.74 x 10"" 


1.26 x 


10 -16 


-3.52x10"" 


R 2S 


-1.30x 10"" 


1.30 x 10"" 


1.65 x 10"" 


3.07 x 


io- 17 


-8.40 xlO" 18 


R 29 


-1.52 x IO" 10 


1.52 x IO -10 


1.92 x IO" 10 


3.59 x 


io- 10 


-9.65 x 10"" 


R 30 


-3.45 x 10"" 


3.45 x 10"" 


4.36 x lO" 11 


8.14 x 


io-" 


-2.17x10"" 


R 31 


-9.57 x IO" 16 


9.56 x IO" 16 


1.21 x IO" 15 


2.25 x 


IO" 15 


-6.24 xlO" 16 


R 32 


-2.29 x IO" 16 


2.29 x IO" 16 


2.89 x IO" 16 


5.40 x 


10 -16 


-1.47 xlO" 16 


R 33 


3.01 x IO" 01 


-3.04 x IO" 01 


-3.89 x IO" 01 


-7.27 x 


IO" 01 


9.32 x IO" 02 


R 34 


1.65 x IO" 01 


-1.63 x IO" 01 


-2.02 x IO" 01 


-3.75 x 


IO" 01 


1.82 x IO" 01 


R 3S 


7.07 x IO" 02 


-9.53 x IO" 02 


-1.20 x IO" 01 


-2.14 x 


IO" 01 


-1.65 x IO" 01 


R 36 


9.21 x IO" 07 


-8.66 x IO" 07 


-1.02 x IO" 06 


-1.88 x 


10 -06 


2.24 x IO" 06 


R 31 


4.66 x IO" 00 


-4.66 x 10-"° 


-5.80 x IO" 00 


-10.8 x 


IO" 00 


4.25 x IO" 00 


R 3S 


4.42 x IO" 00 


-4.40 x 10-"° 


-5.52 x lO" 00 


-10.3 x 


IO" 00 


3.57 x IO" 00 


R 39 


1.38 x IO" 02 


-1.66 x IO" 02 


-2.14 x IO" 02 


-3.92 x 


io- 02 


-2.38 x IO" 02 


K 40 


1.89 x IO" 05 


-1.91 x IO" 05 


-2.50 x IO" 05 


-4.70 x 


io- 05 


-2.90 x IO" 06 


R 41 


9.66 x IO" 02 


-1.06 x IO" 01 


-1.39 x IO" 01 


-2.60 x 


IO" 01 


-1.11 x IO" 01 


R 41 


8.36 xlO" 04 


-8.38 x IO" 04 


-9.49 x IO" 04 


-1.71 x 


io- 03 


2.20 x IO" 03 


R 43 


9.81 x IO" 01 


-1.32 x 10-"° 


-1.66 x IO" 00 


-2.98 x 


IO" 00 


-2.31 x IO" 00 


R 44 


1.55 x IO" 07 


-2.21 x IO" 07 


-3.18 x IO" 07 


-5.99 x 


io- 07 


-1.15 x IO" 06 


R 45 


3.71 x IO" 01 


-3.77 x IO" 01 


-4.81 x IO" 01 


-8.99 x 


IO" 01 


9.85 x IO" 01 


g*S 


9.93 x IO" 02 


-9.79 x IO" 02 


-1.19 x IO" 01 


-2.21 x 


IO" 01 


1.42 x IO" 01 


R 41 


1.27 x IO" 01 


-1.64 x IO" 01 


-2.09 x IO" 01 


-3.79 x 


IO" 01 


-2.87 x IO" 01 


R 48 


-1.17 x IO" 06 


1.24 x IO" 06 


1.77 x IO" 06 


3.41 x 


io- 06 


3.24 x IO" 06 


R 49 


2.46 x lO -00 


-2.62 x lO" 00 


-3.46 x lO -00 


-6.49 x 


IO" 00 


-2.28 x IO" 00 


R so 


3.98 x IO" 02 


-3.98 x IO" 02 


-4.93 x IO" 02 


-9.14 x 


io- 02 


4.05 x IO" 02 


R 51 


2.25 x IO" 00 


-2.65 x IO" 00 


-3.45 x IO" 00 


-6.36 x 


io-°" 


-3.84 x IO" 00 
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Table 9. Consequent parameters of the Controller 1 - T-S FLC 1 (cont. 1 of 3) 



R> 


Pl.6 


Pl.7 


Pi, 8 


Pi, 9 


R 1 


8.95 x 10" IU 


-2.53x10"" 


2.17 x 10~ 13 


6.00 xlO" 1 " 


A 2 


2.00 x 1(T 10 


-5.65 x 10~ 12 


3.65 x 10~ 14 


1.34 x 10" 10 


A 3 


5.89 x 10~ 15 


-1.67 xlO" 16 


2.40 x 10~ 18 


3.93 x 10" 15 


A 4 


1.37 x 10~ 15 


-3.88 xlO" 17 


4.06 x 10~ 19 


9.15 xlO -16 


A 5 


1.54 x lO" 08 


-4.36 xlO" 10 


3.35 x 10~ 12 


1.03 xlO~ 08 


A 6 


3.41 x 10-" 9 


-9.66 xlO~" 


5.62 x 10~ 13 


2.29 x 10~ 09 


R 1 


1.03 x 10~ 13 


-2.93 x 10~ 15 


3.70 x 10~ 17 


6.90 xlO" 14 


A 8 


2.37 x 10~ 14 


-6.71 x 10~ 16 


6.26 x 10~ 18 


1.59 x 10~ 14 


R 9 


5.04 x 10" 11 


-1.43 xlO" 12 


1.46 x 10~ 14 


3.38X10" 11 


A' 10 


1.14 X 10" 11 


-3.22 xl0~ 13 


2.47 x 10~ 15 


7.63 x 10~ 12 


A" 


3.22 x 10~ 16 


-9.15 xlO~ 18 


1.60 x 10~ 19 


2.14 x 10" 16 


A 12 


7.63 x 10~ 17 


-2.16 xlO~ 18 


2.73 x lO" 20 


5.10x 10~ 17 


A 1 - 1 


8.72 x lO" 10 


-2.47x10-" 


2.25 x 10~ 13 


5.85 xlO" 10 


A 14 


1.95 x 10" 10 


-5.53 xlO" 12 


3.79 x 10" 14 


1.31 xlO -10 


A 15 


5.68 x 10~ 15 


-1.62 xl0~ 16 


2.49 x 10~ 18 


3.79 x 10~ 15 


A 1 " 


1.33 x 10~ 15 


-3.77 xl0~ 17 


4.21 x 10~ 19 


8.89 x 10~ 16 


A 17 


5.95 x 10 -10 


-1.69 x 10 -11 


1.70 x 10~ 13 


3.99 x 10- 10 


A 18 


1.34 x 10- 10 


-3.80 xlO- 12 


2.86 x 10~ 14 


9.00x10"" 


A 19 


3.81 x 10~ 15 


-1.08 xl0~ 16 


1.86 x 10~ 18 


2.54 x 10~ 15 


A 20 


9.01 x 10~ 16 


-2.58 x 10~ 17 


3.17 x 10~ 19 


6.03 x 10" 16 


A- 1 


1.03 x 10-" 8 


-2.92 x lO" 10 


2.61 x 10~ 12 


6.90 x 10-° 9 


A 22 


2.30 x 10-" 9 


-6.52x10-" 


4.40 x 10~ 13 


1.55 x 10- 09 


A 23 


6.72 x 10~ 14 


-1.91 xl0~ 15 


2.88 x 10~ 17 


4.49 x 10~ 14 


A- 4 


1.57 x 10~ 14 


-4.45 x 10~ 16 


4.89 x 10~ 18 


1.05 x 10~ 14 


A- 5 


3.33 x 10-" 


-9.44 xl0~ 13 


1.14x 10~ 14 


2.23 x 10"" 


A-" 


7.61 x 10~ 12 


-2.16 xl0~ 13 


1.93 x 10~ 15 


5.10x 10~ 12 


A 27 


2.05 x 10~ 16 


-5.83 xlO^ 18 


1.24 x 10~ 19 


1.36 xlO -16 


A- 8 


4.97 x 10~ 17 


-1.41 x 10~ 18 


2.12 x lO" 20 


3.32 x 10~ 17 


A 29 


5.79 x 1(T 10 


-1.64x10-" 


1.76 x 10~ 13 


3.88 x lO" 10 


A 10 


1.31 x 1(T 10 


-3.71 xl0~ 12 


2.97 x 10~ 14 


8.79 x 10"" 


A 11 


3.66 x 10~ 15 


-1.04 xlO" 16 


1.92 x 10~ 18 


2.44 x 10~ 15 


A 52 


8.73 x 10- 16 


-2.48 xlO" 17 


3.28 x 10~ 19 


5.83 x 10 16 


A" 


-1.05 x 10 -00 


2.76 x 10-° 2 


4.29 x 10" 03 


-7.75 x 10 -01 


A 14 


-6.98 x lO" 01 


2.15 x 10" 02 


-3.83 x lO" 03 


-4.14 x 10 -01 


R 35 


-1.65 x lO" 02 


-4.40 x 10-° 3 


1.05 x 10" 02 


-2.22 x 10" 01 


R 36 


-5.10 x lO" 06 


1.82 xlO -07 


-8.01 x 10~ 08 


-2.21 x 10" 06 


A 17 


-18.8 x 10-"" 


5.64 xlO" 01 


-6.52 x 10" 02 


-11.8 xlO" 00 


A 18 


-17.5 x 10-"° 


5.15 xlO" 01 


-4.14 x 10" 02 


-11.2 x 10" 00 


R 39 


-1.60 x 10~ 02 


-2.96 x lO" 04 


1.59 x 10" 03 


-4.01 x lO" 02 


A 40 


-5.86 x 10-" 5 


1.32 xlO" 06 


6.74 x 10" 07 


-4.90 x 10" 05 


A 41 


-1.90 x lO" 01 


1.46 x lO" 03 


8.20 x 10" 03 


-2.66 x 10 -01 


A 42 


-4.61 x 10-" 3 


1.69 x lO" 04 


-7.82 x 10" 05 


-2.06 x lO" 03 


R« 


-1.98 x lO" 01 


-6.28 x 10" 02 


1.47 x 10" 01 


-3.08 x 10" 00 


A 44 


6.88 x 10-" 7 


-4.82 x 10-° 8 


6.03 x 10" 08 


-5.31 x 10" 07 


A 45 


-1.28 x 10-"" 


3.31 x 10" 02 


6.10 x 10" 03 


-9.57 x 10-°' 


A 46 


-4.50 x 10~ 01 


1.46 x 10- 02 


-3.82 x lO" 03 


-2.48 x 10-°' 


A 47 


-5.58 x lO" 02 


-6.95 x lO" 03 


1.83 x 10" 02 


-3.88 x 10" 01 


A 48 


8.12 x lO" 07 


6.76 xlO" 08 


-1.86 x 10~ 07 


3.23 x lO" 06 


A 41 ' 


-5.57 x 10-"° 


7.03 x lO" 02 


1.81 x 10" 01 


-6.63 x 10" 00 


A 50 


-1.65 x 10- ()1 


5.01 x 10-° 3 


-7.52 x 10" 04 


-1.01 xlO -01 


A 51 


-2.81 x 10" 00 


-4.16 xlO -02 


2.56 x 10" 01 


-6.48 x 10-°° 
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Table 10. Consequent parameters of the Controller 1 - T-S FLC 1 (2 of 3) 



ri 


Pl.l 


Pi. 2 


Pl.3 


Pl,4 


Pl.5 


r si 


2.05 x 1CT 05 


-2.16x 10-° 5 


-2.89 x 


-05 


-5.45 x 10- 05 


-2.30 x 10-° 5 


R 53 


5.85 x 10-°° 


-6.08 x 10-°° 


-7.87 x 


o- 00 


-14.7 x 10-°° 


-1.60 x 10-°° 


R 54 


1.44 x 10" 00 


-1.43 x 10-°° 


-1.76 x 


o- 00 


-3.27 x 10-°° 


1.62 x 10-°° 


R 55 


1.05 x 10" 00 


-1.20 x lO -00 


-1.58 x 


o- 00 


-2.95 x 10-°° 


-1.83 x lO" 00 


R 56 


-2.44 x 10-° 6 


3.36 x 10-° 7 


-2.68 x 


-08 


2.09 x 10-° 7 


-2.93 x 10-° 5 


R 57 


5.32 x irr 00 


-5.98 x lO -00 


-7.88 x 


O-oo 


-14.7 x 10-°° 


-8.07 x lO -00 


R 58 


3.09 x 10-° 3 


-3.07 x lO" 03 


-3.91 x 


Q-03 


-7.31 x 10-° 3 


1.66 x 10-° 3 


R 59 


4.43 x 10 -00 


-7.27 x 10-°° 


-7.81 x 


O-oo 


-12.49 x lO -00 


1.33 x 10-°° 


R 60 


1.48 x 10-° 5 


-1.60x lO" 05 


-2.06 x 


0-05 


-3.83 x 10-° 5 


-7.89 x 10-° 6 


R 61 


6.12 x irr 00 


-6.52 x 10-°° 


-8.58 x 


O-oo 


-16.1 x 10-°° 


-5.52 x lO -00 


R 62 


3.38 x 10-° 2 


-3.41 x lO" 02 


-4.34 x 


Q-02 


-8.10 x 10-° 2 


1.42 x 10-° 2 


R« 


6.65 x irr 00 


-7.32 x 10 -00 


-9.77 x 


O-oo 


-18.3 x 10-°° 


-10.4 x lO" 00 


R 64 


5.77 x 10-° 5 


-5.97 x lO" 05 


-7.90 x 


Q-05 


-1.49 x 10-° 4 


-3.97 x 10-° 5 


ff 65 


-4.19 x 10~ 24 


4.18 x lO" 24 


5.29 x 


Q-24 


9.87 x 10- 24 


-2.62 x 10- 24 


ff 66 


-9.24 x 10- 25 


9.24 x lO" 25 


1.17x 


Q-24 


2.18 x 10- 24 


-5.74 x 10- 25 


R 67 


-2.83 x 10- 29 


2.83 x 10- 29 


3.57 x 


Q-29 


6.67 x 10- 29 


-1.80 x 10- 29 


ff 68 


-6.45 x 10- 30 


6.45 x 10- 30 


8.15 x 


Q-30 


1.52 x 10- 29 


-4.06 x 10- 30 


R 69 


-7.17 x 10" 23 


7.17 x lO" 23 


9.06 x 


Q-23 


1.69 x lO" 22 


-4.47 x lO" 23 


R 10 


-1.58 xlO- 23 


1.58 x lO" 23 


1.99 x 


Q-23 


3.72 x lO" 23 


-9.78 x 10- 24 


R 71 


-4.92 x 10~ 28 


4.92 x lO" 28 


6.21 x 


Q-28 


1.16 x lO" 27 


-3.11 x 10- 28 


R 12 


-1.11 x 10- 28 


1.11 x lO" 28 


1.40 x 


Q-28 


2.62 x 10- 28 


-6.96 x 10- 29 


R 73 


-2.38 x 10- 25 


2.38 x lO" 25 


3.00 x 


o- 25 


5.60 x 10- 25 


-1.49x 10- 25 


R 14 


-5.30 x 10- 26 


5.30 x lO" 26 


6.70 x 


Q-26 


1.23 x 10- 25 


-3.31 x lO" 26 


R 75 


-1.57 xlO- 30 


1.57 x 10- 30 


1.98 x 


q-30 


3.69 x 10- 30 


-1.01 x 10- 30 


R 76 


-3.64 x 10- 31 


3.63 xlO" 31 


4.59 x 


o- 31 


8.57 x lO" 31 


-2.30 x 10- 31 


R 11 


-4.09 x 10- 24 


4.09 x 10- 24 


5.17x 


q-24 


9.64 x lO" 24 


-2.56 x 10- 24 


R 78 


-9.06 x 10- 25 


9.06 xlO" 25 


1.14x 


q-24 


2.14 x 10- 24 


-5.64 x lO" 25 


R 19 


-2.74 x 10~ 29 


2.74 x 10- 29 


3.46 x 


0-29 


6.46 x 10- 29 


-1.75 x 10- 29 


R w 


-6.29 x 10- 30 


6.29 x 10- 30 


7.94 x 


q-30 


1.48 x 10- 29 


-3.96 x 10- 30 


R u 


-2.80 x 10- 24 


2.80 x lO" 24 


3.54 x 


q-24 


6.61 x 10- 24 


-1.76 x 10- 24 


R S2 


-6.25 x 10- 25 


6.24 x 10- 25 


7.89 x 


o- 25 


1.47 x 10- 24 


-3.89 x 10- 25 


R 83 


-1.85 x 10" 29 


1.85 xlO- 29 


2.34 x 


0-29 


4.37 x lO" 29 


-1.19 x 10- 29 


R M 


-4.29 x 10- 30 


4.29 x 10- 30 


5.42 x 


q-30 


1.01 x 10- 29 


-2.71 x 10- 30 


R 85 


-4.82 x 10- 23 


4.82 x lO" 23 


6.09 x 


q-23 


1.14 x lO" 22 


-3.02 x 10- 23 


R B6 


-1.07 x 10- 23 


1.07 x lO" 23 


1.35 x 


q-23 


2.52 x 10- 23 


-6.64 x 10- 24 


R 87 


-3.24 x 10- 28 


3.24 x lO" 28 


4.09 x 


q-28 


7.64 x lO" 28 


-2.07 x 10- 28 


R 88 


-7.42 x 10~ 29 


7.42 x 10- 29 


9.37 x 


0-29 


1.75 x 10- 28 


-4.67 x 10- 29 


R 89 


-1.58 xlO" 25 


1.58 x 10- 25 


2.00 x 


0-25 


3.73 x 10- 25 


-10.00 x 10- 26 


R 90 


-3.56 x 10~ 26 


3.56 xlO" 26 


4.50 x 


q-26 


8.40 x 10- 26 


-2.23 x 10- 26 


R 91 


-1.02 x 10- 30 


1.01 x 10- 30 


1.28 x 


q-30 


2.39 x 10- 30 


-6.57 x 10- 31 


R 92 


-2.40 xlO- 31 


2.40 x 10- 31 


3.03 x 


o- 31 


5.65 x 10- 31 


-1.53 x 10- 31 


R 93 


-2.74 x 10- 24 


2.73 x lO" 24 


3.45 x 


q-24 


6.45 x 10- 24 


-1.72 x 10- 24 


R 94 


-6.11 xlO" 25 


6.11 x lO" 25 


7.72 x 


o- 25 


1.44 x lO" 24 


-3.82 x 10- 25 


R 95 


-1.79 x 10- 29 


1.79 x 10- 29 


2.26 x 


Q-29 


4.22 x 10- 29 


-1.15 x 10- 29 


R 96 


-4.17 xlO" 30 


4.17 x 10- 30 


5.27 x 


q-30 


9.83 x 10- 30 


-2.64 x 10- 30 


R 97 


1.61 x 10-° 4 


-2.33 x 10-° 4 


-2.86 x 


Q-04 


-5.03 x 10-° 4 


-4.08 x 10-° 4 


R 98 


1.99 x 10-' 2 


-2.66 x 10 I2 


-3.37 x 


o- 12 


-6.11 x 10-' 2 


-5.21 x 10-' 2 


R" 


3.16 x 10-° 2 


-4.69 x 10-° 2 


-5.69 x 


q-02 


-9.94 x 10-° 2 


-8.04 x 10-° 2 


R 100 


3.72x10-'° 


-4.89 x 10- 10 


-6.23 x 


0-10 


-1.14 x 10-° 9 


-9.88 x 10-'° 


R 101 


1.21xl0-° 5 


-1.72 xlO- 05 


-2.12 x 


q-05 


-3.75 x 10-° 5 


-3.04 x 10" 05 
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Table 11. Consequent parameters of the Controller 1 - T-S FLC 1 (cont. 2 of 3) 



R 1 


Pi. 6 


Pl,7 


Pl,8 


Pl.9 


R= 2 


-4.34 x 


-(b 


4.09 x 10-"' 


1.69 x 


-06 


-5.51 x 10- Ui 


R 53 


-16.9 x 


-oo 


3.59 x 10- 01 


2.48 x 


O-oi 


-15.4 x nr m 


R 5A 


-6.11 x 


-oo 


1.89 x 10- 01 


-3.49 x 


q-02 


-3.61 x lO" 00 


R 55 


-1.40 x 


0-oo 


-1.77 x 10-° 2 


1.20 x 


O-oi 


-2.96 x lO -00 


R 56 


3.96 x 


Q-05 


-1.76 x 10-° 6 


1.37 x 


Q-06 


2.41 x lO" 06 


R S7 


-8.41 x 


0-oo 


-2.37 x 10-° 2 


5.48 x 


O-oi 


-14.9 x ur m 


R 58 


-1.15 x 


Q-02 


3.20 x 10-° 4 


9.54 x 


Q-06 


-7.85 x lO" 03 


R 59 


-7.79 x 


0-oo 


1.72 x 10" 01 


1.57 x 


O-oi 


-15.4 x ur m 


R 60 


-3.81 x 


q-05 


6.84 x 10-° 7 


8.25 x 


0-07 


-4.00 x lO" 05 


R 61 


-13.9 x 


0-oo 


1.81 x 10- 01 


4.43 x 


O-oi 


-16.5 x ur m 


R 62 


-1.21 x 


0-oi 


3.27 x 10-° 3 


3.10x 


q-04 


-8.68 x lO" 02 


R 63 


-10.7 x 


0-oo 


-3.12 x 10-° 2 


6.95 x 


O-oi 


-18.4 x ur m 


R 64 


-1.48 x 


Q-04 


2.45 x 10-° 6 


3.54 x 


q-06 


-1.53 xlO" 04 


R 65 


1.59 x 


o- 23 


-4.49 x 10- 25 


3.15 x 


o- 27 


1.07 x ur 23 


R 66 


3.50 x 


o- 24 


-9.91 x 10- 26 


5.28 x 


0-28 


2.35 x 10~ 24 


R 67 


1.08 x 


Q-28 


-3.05 x 10- 30 


3.49 x 


Q-32 


7.20 x lO" 29 


R 68 


2.45 x 


0-29 


-6.94 x 10~ 31 


5.90 x 


o- 33 


1.64 x 10~ 29 


R 69 


2.72 x 


o- 22 


-7.70 x 10~ 24 


4.84 x 


0-26 


1.83 x ur 22 


R 70 


5.97 x 


o- 23 


-1.69 x 10~ 24 


8.09 x 


o- 27 


4.01 x nr 23 


R 71 


1.87 x 


o- 27 


-5.30 xlO- 29 


5.38 x 


o- 31 


1.25 x 10~ 27 


R 72 


4.22 x 


q-28 


-1.19 x 10- 29 


9.08 x 


q-32 


2.83 x 10~ 28 


R 73 


9.03 x 


o- 25 


-2.56 x 10- 26 


2.12 x 


q-28 


6.05 x K)- 25 


R 74 


2.01 x 


0-25 


-5.69 x 10~ 27 


3.57 x 


0-29 


1.35 x 10~ 25 


R 75 


5.97 x 


Q-30 


-1.70 x 10- 31 


2.34 x 


o- 33 


3.99 x lO" 30 


R 76 


1.38 x 


Q-30 


-3.92 x 10- 32 


3.97 x 


0-34 


9.25 x 10~ 31 


R 77 


1.55 x 


o- 23 


-4.39 x 10- 25 


3.27 x 


o- 27 


1.04 x ur 23 


R 78 


3.44 x 


o- 24 


-9.72 x 10- 26 


5.48 x 


0-28 


2.31 x 10~ 24 


R 19 


1.04 x 


q-28 


-2.96 x 10~ 30 


3.61 x 


q-32 


6.98 x 10~ 29 


R 80 


2.39 x 


Q-29 


-6.77 x 10- 31 


6.12 x 


o- 33 


1.60 x nr 29 


R 81 


1.06 x 


o- 23 


-3.01 x 10- 25 


2.46 x 


o- 27 


7.14 x 10~ 24 


R 82 


2.37 x 


Q-24 


-6.70 x 10- 26 


4.14 x 


0-28 


1.59 x ur 24 


R 83 


7.06 x 


q-29 


-2.01 x 10- 30 


2.72 x 


q-32 


4.72 x 10~ 29 


R 84 


1.63 x 


q-29 


-4.62 x 10- 31 


4.61 x 


o- 33 


1.09 x 10~ 29 


R 85 


1.83 x 


o- 22 


-5.18 x 10- 24 


3.79 x 


Q-26 


1.23 x ur 22 


R 86 


4.05 x 


o- 23 


-1.14 x 10~ 24 


6.35 x 


o- 27 


2.72 x 10~ 23 


R 87 


1.23 x 


o- 27 


-3.50 x 10- 29 


4.20 x 


o- 31 


8.25 x lO" 28 


R 88 


2.82 x 


q-28 


-7.99 x 10- 30 


7.10x 


q-32 


1.89 x nr 2g 


R 89 


6.01 x 


Q-25 


-1.70 x 10- 26 


1.66 x 


0-28 


4.03 x 10~ 2S 


R 90 


1.35 x 


o- 25 


-3.83 x 10- 27 


2.79 x 


q-29 


9.07 x Ur 26 


R 91 


3.87 x 


q-30 


-1.10 x 10- 31 


1.82 x 


o- 33 


2.58 x 10~ 30 


R 92 


9.12 x 


o- 31 


-2.59 x 10- 32 


3.09 x 


0-34 


6.10 x 10~ 31 


R 93 


1.04 x 


o- 23 


-2.94 x 10- 25 


2.55 x 


o- 27 


6.96 x lO" 24 


R 94 


2.33 x 


o- 24 


-6.56 x 10- 26 


4.29 x 


0-28 


1.56 xlO" 24 


R 95 


6.82 x 


q-29 


-1.94 x 10- 30 


2.81 x 


q-32 


4.56 x lO" 29 


R 96 


1.59 x 


q-29 


-4.50 x 10- 31 


4.77 x 


o- 33 


1.06 x 10~ 29 


R 97 


3.23 x 


q-05 


-1.27 x 10-° 5 


2.58 x 


0-05 


-5.29 x 10" 04 


R 98 


-3.25 x 


o- 14 


-1.48 x 10 l3 


3.20 x 


o- 13 


-6.21 x 10~ 12 


R" 


9.58 x 


Q-03 


-2.61 x 10-° 3 


5.12 x 


q-03 


-1.05 x lO" 01 


R 100 


-1.01 x 


o-" 


-2.78 x 10-" 


6.02 x 


o- 11 


-1.15 xlO- 09 


R 101 


1.72 x 


0-06 


-9.30 x lO" 07 


1.92 x 


q-06 


-3.92 x lO" 05 
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Table 12. Consequent parameters of the Controller 1 - T-S FLC 1 (3 of 3) 



R> 


Pl,l 


Pl,2 


Pl,3 


Pl.4 


Pi. 5 


R m 


2.35 x 1(T 1J 


-2.98 x 


o- 13 


-3.74 x 10~ 13 


-6.76 x 


o- 13 


-4.01 x 10~ 1J 


R m 


2.13 x 10-° 3 


-3.12 x 


-03 


-3.80 x 10-° 3 


-6.65 x 


q-03 


-5.36 x 10- 03 


R 104 


3.11 x 10-" 


-4.27 x 


o-" 


-5.35x10-" 


-9.60 x 


o-" 


-8.11 x 10"" 


S 105 


2.47 x 10-° 4 


-3.58 x 


-03 


-4.38 x 10-° 4 


-7.71 x 


q-03 


-6.21 x 10-° 3 


R [06 


3.96 x 10-" 


-5.53 x 


o-" 


-6.88 x 10-" 


-1.27 x 


O-io 


-1.02 x 10-'° 


S 107 


4.53 x 10-°' 


-6.78 x 


O-oi 


-8.19 x 10-°' 


-1.43 x 


O-oo 


-1.14 x 10-°° 


R 108 


7.78 x lO" 09 


-1.09 x 


0-08 


-1.35 x 10-° 8 


-2.41 x 


Q-08 


-2.02 x 10~ 08 


R m 


1.93 x 10" 04 


-2.73 x 


0-04 


-3.37 x 10~ 04 


-5.97 x 


0-04 


-4.83 x lO" 04 


R no 


3.08 x 1(T 12 


-4.34 x 


o- 12 


-5.37 x 10 I2 


-9.55 x 


o- 12 


-7.88 x 10- 12 


R m 


3.10 x 10-° 2 


-4.53 x 


Q-02 


-5.52 x 10-° 2 


-9.68 x 


q-02 


-7.73 x 10~ 02 


R 112 


5.79 x 10- 10 


-8.20 x 


O-io 


-1.01 x lO" 09 


-1.80x 


q-09 


-1.49 x lO" 09 


S I13 


1.15 x 10~ 03 


-1.61 x 


0-03 


-2.00 x 10~ 03 


-3.56 x 


q-03 


-2.96 x 10~ 03 


R M 


1.35 x 10-" 


-1.91 x 


o-" 


-2.36 x 10-" 


-4.18 x 


o-" 


-3.45 x 10"" 


R 11S 


1.96 x 10-°' 


-2.69 x 


O-oi 


-3.36 x 10- 01 


-6.02 x 


O-oi 


-4.98 x 10" 01 


R m 


2.40 x lO" 09 


-3.38 x 


Q-09 


-4.18 x lO" 09 


-7.43 x 


q-09 


-6.12x10-°" 


R m 


8.52 x lO" 05 


-1.22 x 


0-04 


-1.50 x 10~ 04 


-2.65 x 


0-04 


-2.21 x 10-° 4 


R m 


1.05 x 10- n 


-1.47 x 


o- 12 


-1.82 x 10~ 12 


-3.22 x 


o- 12 


-2.55 xl0~ 12 


R m 


1.67 x 10-° 2 


-2.32 x 


q-02 


-2.88 x lO" 02 


-5.15 x 


q-02 


-4.28 x 10~ 02 


R no 


1.92 x 10- 10 


-2.70 x 


O-io 


-3.34 x 10-'° 


-5.94 x 


O-io 


-4.91 x lO" 10 


R m 


2.12 x 10~ 02 


-2.97 x 


q-02 


-3.69 x 10~ 02 


-6.57 x 


q-02 


-5.47 x 10~ 02 


R U2 


2.38 x 10 -10 


-3.36 x 


-io 


-4.16 x 10-'° 


-7.38 x 


O-io 


-6.11 x lO" 10 


R m 


3.66 x 10-°° 


-5.00 x 


O-oo 


-6.26 x lO" 00 


-11.2x 


O-oo 


-9.31 x lO" 00 


S 124 


4.24 x lO" 08 


-5.92 x 


0-08 


-7.36 x 10-° 8 


-1.31 x 


0-07 


-1.08 x 10-° 7 


R 125 


1.55 x 10-° 3 


-2.22 x 


Q-03 


-2.73 x lO" 03 


-4.83 x 


q-03 


-4.03 x 10~ 03 


S 126 


1.75 x 10-" 


-2.50 x 


o-" 


-3.08 x 10-" 


-5.45 x 


o-" 


-4.52 x 10-" 


R 127 


3.12 x 10-°' 


-4.33 x 


O-oi 


-5.39 xlO- 01 


-9.63 x 


O-oi 


-8.01 x lO" 01 


R 128 


3.42 x lO" 09 


-4.80 x 


q-09 


-5.95 x 10-° 9 


-1.06 x 


q-08 


-8.76 x 10~ 09 


S 129 


-6.95 x 10- 18 


6.95 x 


0-18 


8.77 x 10 I8 


1.64 x 


o- 17 


-4.41 x lO" 18 


R 130 


-1.58 x 10- 18 


1.58 x 


Q-18 


2.00 x 10~ 18 


3.73 x 


q-18 


-9.94 x 10~ 19 


jj131 


-4.32 x 10- 23 


4.32 x 


o- 23 


5.45 x 10~ 23 


1.02 x 


o- 22 


-2.83 x 10~ 23 


S 132 


-1.04 x 10- 23 


1.04 x 


o- 23 


1.32 x 10~ 23 


2.46 x 


q-23 


-6.69 x lO" 24 


S 133 


-1.21 x 10~ 16 


1.21 x 


0-16 


1.52 x 10~ 16 


2.84 x 


0-16 


-7.63 x 10~ 17 


S 134 


-2.72 x 10- 17 


2.72 x 


o- 17 


3.44 x 10 I7 


6.42 x 


o- 17 


-1.71 x 10~ 17 


R 135 


-7.70 x 10~ 22 


7.69 x 


o- 22 


9.71 x 10~ 22 


1.81 x 


o- 21 


-4.99 x 10~ 22 


R 136 


-1.83 x 10~ 22 


1.82 x 


o- 22 


2.30 x 10~ 22 


4.30 x 


o- 22 


-1.16 x 10~ 22 


R 137 


-3.86 x 10- 19 


3.85 x 


Q-19 


4.87 x 10-' 9 


9.08 x 


q-19 


-2.47 x 10~ 19 


R m 


-8.92 x 10- 20 


8.92 x 


Q-20 


1.13 x 10 I9 


2.10 x 


q-19 


-5.64 x lO" 20 


R m 


-2.28 x 10~ 24 


2.28 x 


o- 24 


2.88 x 10~ 24 


5.37 x 


o- 24 


-1.52x 10~ 24 


S 140 


-5.69 x 10- 25 


5.69 x 


o- 25 


7.18 x 10~ 25 


1.34 x 


Q-24 


-3.69 x lO" 25 


R 141 


-6.74 x 10~ 18 


6.74 x 


q-18 


8.51 x 10~ 18 


1.59 x 


o- 17 


-4.29 x 10~ 18 


R 142 


-1.54 x 10- 18 


1.54 x 


q-18 


1.95 x 10~ 18 


3.64 x 


q-18 


-9.72 x 10~ 19 


R 143 


-4.13 x 10- 23 


4.12 x 


o- 23 


5.20 x 10~ 23 


9.71 x 


q-23 


-2.72 x 10~ 23 


R 144 


-1.01 x 10~ 23 


1.01 x 


Q-23 


1.27 x 10~ 23 


2.37 x 


q-23 


-6.48 x 10~ 24 


R 145 


-4.56 x 10- 18 


4.56 x 


q-18 


5.75 x 10 I8 


1.07 x 


o- 17 


-2.92 x lO" 18 


S 146 


-1.05 x 10~ 18 


1.05 x 


q-18 


1.33 x 10-' 8 


2.48 x 


q-18 


-6.65 x 10~ 19 


S 147 


-2.71 x 10- 23 


2.71 x 


o- 23 


3.42 x 10~ 23 


6.38 x 


q-23 


-1.80 x 10~ 23 


S 148 


-6.74 x 10~ 24 


6.74 x 


0-24 


8.51 x 10~ 24 


1.59 x 


q-23 


-4.37 x 10~ 24 


S 149 


-7.97 x 10~ 17 


7.96 x 


o- 17 


1.01 x 10~ 16 


1.88 x 


0-16 


-5.07 x 10~ 17 


R 150 


-1.82 x 10-' 7 


1.82 x 


o- 17 


2.30 x 10~ 17 


4.29 x 


o- 17 


-1.15 x 10~ 17 


jjlSl 


-4.90 x 10~ 22 


4.89 x 


0-22 


6.17 x 10" 22 


1.15x 


-21 


-3.22 x 10" 22 
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Table 13. Consequent parameters of the Controller 1 - T-S FLC 1 (cont. 3 of 3) 



gf 


Pi. 6 


Pi. 7 


Pl.S 


Pi. 9 


R m 


-2.35 x 


10" 13 


-5.98 x IO -0 


2.73 x IO" 14 


-7.04 x IO" 13 


R 103 


5.05 x 


10 -04 


-1.70 x IO" 04 


3.42 x IO" 04 


-7.03 x IO" 03 


R 1M 


2.34 x 


io- 12 


-2.39 x IO" 12 


5.02 x 10~ 12 


-9.86 x 10-" 


R l0S 


4.37 x 


10 -04 


-1.93 x io-° 4 


3.94 x IO" 04 


-8.11 x IO" 03 


R 106 


4.52 x 


IO" 12 


-3.07 x IO" 12 


6.38 xl0~ 12 


-1.27 x 10" 10 


R m 


1.43 x 


io- m 


-3.72 x IO" 02 


7.30 x IO" 02 


-1.52 x IO" 00 


R im 


1.14 x 


10 -09 


-6.13 x ur 10 


1.26 x 10~ 09 


-2.50 x 10-° 8 


R m 


1.96 x 


io- 05 


-1.46 x 10~ 05 


3.05 x IO" 05 


-6.24 x 10-° 4 


R m 


3.68 x 


io- 13 


-2.38 x IO" 13 


4.93 x 10~ 13 


-9.93 x IO" 12 


R m 


6.43 x 


io- 03 


-2.43 x 10-° 3 


4.93 x IO" 03 


-1.02 x IO -01 


R m 


9.48 x 


io- 11 


-4.56 x IO" 11 


9.33x10-" 


-1.87 x 10-° 9 


R 113 


1.36 x 


IO" 04 


-8.89 x Ur 05 


1.85 x IO" 04 


-3.69 x 10-° 3 


R 114 


1.89 x 


io- 12 


-1.05 x ur 12 


2.16 x 10~ 12 


-4.35 x IO"" 


R 115 


7.18 x 


io- 04 


-1.44 x IO" 02 


3.10 x IO" 02 
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Abstract. This paper describes the design of a locomotion controller to regulate the 
pose of a nonholonomic mobile robot with frontal differential driving wheels, and 
a swivel castor wheel in rear. Differently to common proposals base on dynamic 
models, this controller uses the kinematics model and the mobile robot architecture 
to generate the adequate linear and angular speeds to reach the desired pose. Two 
independent and coordinated fuzzy inference systems are the core of this develop- 
ment that is a step further in autonomy in the sense of independence. Experiments 
and results are shown. 

1 Introduction 

The field of autonomous navigation of mobile robot (MR) has been a focus of re- 
search interest during many years for several reasons: usually, because they are very 
useful when humans are not able to reach certain target because of terrain condi- 
tions, or they could be in danger; particularly, for scientist because of its interdisci- 
plinary nature entails many complex challenges that can be tackled from different 
points of view. 

The autonomy adjective glued to the word navigation confers an intelligent be- 
havior of the MR, and it is related with its ability to sense the situation and act on it 
appropriately, the gained benefit comes along with the necessity to design the robot 
in a way that it is able to respond to a list of complex situations, that includes at least 
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the ability to navigate autonomously, avoiding modeled and unmodeled obstacles 
especially in crowded and unpredictably changing environment. The intelligence of 
a robot in the sense of independence will depend on how much and quickly it can 
sense its environment and itself, as well as the capacity to create its own control law; 
thus, an MR with such capacities must have at least sensors, effectors/actuators, 
a locomotion system, the on-board computer system, and a set of controllers to 
coordinate all the components to achieve the desired task in the desired way 1T81 . 

This work falls in the robot control area that refers to the way in which the sensing 
and action of a robot are coordinated. There are several approaches to control a MR, 
being the best known ifTTll : 

• Reactive Control. It is a technique that produces timely robotic response in dy- 
namic and non-structured worlds. The idea can be summarized as "Do not think, 
just react". This idea tightly couples sensing and actions. Learning is not an 
objective of this technique O El El . 

• Deliberative Control. This is a model based on sensing, planning and action. The 
planing stage slows down the process |[T6l lf2TTl . 

• Hybrid Control. Combines the two extremes of reactive and deliberative 
systems 0] iflOt 

• Behavior-Based Control. The basic idea is to subdivide the navigation task into 
small and easy to manage programs behaviors that focus on concurrently execu- 
tion of specific subtasks. It uses a "divide and conquer" strategy that makes the 
system modular lfl4l |[T8l l20l. 

Being more specific, we present a controller for the locomotion system that can be 
used in any of the aforementioned approaches to control a nonholonomic differen- 
tial MR with frontal traction and swivel castor wheel in rear. Differently to common 
methods that require an accurate dynamic model that involves the interacting forces, 
and the design of algorithms to directly control the motors torque that difficult im- 
plementation since they are based on unrealistic models, this proposal uses the kine- 
matics model and the mobile robot architecture to generate a fuzzy knowledge base, 
in order to obtain the adequate actions to achieve the desired pose without the need 
to know a complex unrealistic mathematical model. 

A detailed analytical study of the structure of the kinematic and dynamic models 
of wheeled mobile robots can be found in [6|. Wheeled mobile robots constitute 
a class of mechanical systems called nonholonomic mechanical systems [ 8 ] char- 
acterized by kinematic constraints that are not integrable and cannot, therefore, be 
eliminated from the model equations. Nonholonomic behavior in robotic systems 
is, particularly interesting because it implies that the mechanism can be completely 
controlled with reduced number of actuators. 

Fuzzy logic approaches to mobile robot navigation and obstacle avoidance have 
been investigated by several researchers, because they have the ability to treat un- 
certain and imprecise information using linguistic rules, thus, they offer possible 
implementation of human knowledge and experience. Li and Yang [7], proposed an 
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obstacle avoidance approach using fuzzy logic where the input sensors are 
separately inferred. Lee and Wang [ 13 1 proposed a collision-avoidance approach us- 
ing fuzzy logic, where different modules, such as avoiding-static-obstacle module, 
avoiding-moving-obstacle module, and directing-toward-target module, are created 
for the robot navigation. However, these modules are separately inferred and are not 
as coordinated as human reasoning. Xu and Tso [22|, proposed a reactive behavior- 
based fuzzy logical controller, which utilizes the rules to define the robot reaction to 
unknown environments and applies fuzzy inference to coordinate different reactive 
behaviors. Saffiotti reviewed some of the proposals in the literature about the uses of 
fuzzy logic in autonomous robot navigation and discussed the pros and cons of fuzzy 
logic solutions in 1 3 1 . He pointed out that if only weak knowledge is available, then 
fuzzy logic may provide a more adequate tool, otherwise, the stronger techniques, 
such as classical control or probability theory, should probably be used [2 1. Saffiotti 
also proposed some fuzzy logic methods for robot navigation; however, these meth- 
ods cannot guarantee that the robot will not be trapped on local minima or infinite 
loops. 

This work has five main sections, the first section corresponds to the present 
introduction. Section two briefly outlines the kinematics of a differential-drive non- 
holonomic MR to obtain the equations used in this development. In section three 
the locomotion system for controlling the pose of the MR is explained. Two repre- 
sentative experiments and results are given in section four. Finally, the conclusions 
are given in section five. 

2 Kinematics of a Differential-Drive MR 

Fig. 03 shows the structure of an MR with two connected traction wheels (differen- 
tial) in the front, and one unpowered swivel castor wheel in rear. This MR belongs 
to a large class of mechanical nonholonomic systems described by the dynamic 
equation given in (Q]) based on the Euler Lagrange Formulation HI . 

M{q)q + V{q,q)q + F(q) + G{q) + T d = B{q)T-A T {q)X (1) 

where q is the n dimensional vector of configuration variables, M(q) £ R" x " is 
a symmetric positive definite inertia matrix, V(q,q) £ R" x " is the centripetal and 
coriolis matrix, F(q) £ R" xl denotes the surface friction, G(q) £ R is the grav- 
itational vector, Tj denotes bounded unknown disturbances including unstructured 
unmodeled dynamics, B{q) £ W ixr is the input transformation matrix, x £ R" xl 
is the input vector, A (q) £ R mx " is the matrix associated with the constraints, and 
X £ R mx l is the vector of constraints forces. 
The nonholonomic constraint is given by, 

A(q)q = (2) 
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Fig. 1. A nonholonomic mobile robot. The reference point C can be shifted a distance d, i.e, 
the point Cj ; in our case, d = 0. 



Defining S(q) as a Jacobian matrix that transforms velocities v in mobile base coor- 
dinates, to velocities q in Cartesian coordinates, where S(q) is as a full rank matrix 
(n — m) formed by a set of smooth and linearly independent vector fields spanning 
the null space of A (q), 



S'(q)A J (q)=0 



(3) 



According to expressions Q and ©, it is possible to find an auxiliary vector time 
function v(t) e R^ m '> such that, for all t, 



q = S(q)v(t) 



(4) 



This MR is analyzed as a rigid body on wheels operating on a horizontal plane; by a 
rigid body we refer to the robot chassis ignoring the degrees of freedom (DOF) and 
flexibility due to the wheel axes, wheel steering joints, and the swivel caster wheel. 
The total of DOF of this robot chassis on the plane is three, two for position in the 
plane (X, Y), and one for the orientation 6. however, only two DOF are controllable, 
it can go to the front or to the rear applying equal speed to the driving motors, the 
angle is obtained by the application of differential speed, hence we have only two 
controllable DOF. This MR is nonholonomic because the number of controllable 
DOF is smaller than the existing. The nonholonomic constraint states that the robot 
can only move in the direction normal to the axis of the driving wheels, i.e., the 
mobile base satisfies the conditions of pure rolling and nonslipping j3) (9), 



y c cosd — x c sin G — dd = 



(5) 
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It is necessary to use two different reference frames: A global reference coordinate 
system for the position and a robot local reference frame. Hence, to specify the po- 
sition of the MR on the plane it is indispensable to establish a relationship between 
the global reference frame of the plane an the local frame of the robot, as it is il- 
lustrated in Fig. [TJ for this purpose, the axes X and Y define an arbitrary inertial 
basis on the plane as the global reference frame from some origin O : {X,7}. To 
specify the position of the MR, choose a point C on its chassis as a reference point 
that can be shifted a distance d from the driving wheels axis which is marked as 
Cd- The robot's local reference frame is defined using the basis {Xr, Yr} that defines 
two axes relative to C on the robot chassis [ 19']. In this way the position of C in the 
global reference frame is given by coordinates x c and y c , and the angular difference 
between the global and local reference frames is given by 0; using these three values 
the pose of the robot is defined by, 



q= (x c ,y c ,ey 



(6) 



Using (Jffl) we can write the next equations 



cos 9 


—dsinO 


sinQ 


dcosQ 





1 



(7) 



where, 



hence, 



because d — in Fig.Q] 



v = 



cosQ —dsinQ 




r -, 


sinO dcosO 
1 




V 
(0 



cose o 

sin.B 
1 



(8) 



(9) 



(10) 



To describe robot motion in terms of component motion, it is necessary to map 
motion along the axes of the global reference using a transformation matrix and the 
speed components. Note that speed components are the linear and angular speeds, 
and the transforming matrix S(q) for d = is given by, 



S(q) = 



cose o 

sine 
1 



(11) 



In this proposal of pose controller, it is necessary to know the actual pose q, and the 
desired pose q c i to calculate the pose error; hence, we need to obtain q from ( fTob . 
that is easily achieved by integrating q. 
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For the robot of Fig. Q] we assume that the diameter of each wheel is r, the point 
C is centered between the two drive wheels, each wheel is a distance / from C to the 
wheel; so, given r, I, 0, and the spinning speed of each wheel, <p\ and <p2, a forward 
kinematic model must be designed to predict the robot's overall speed in the global 
reference frame; so, we can write expression dT2l) 



:/(Z,r,e,<Pi,<?>2). (12) 



The robot of Fig.[T|is aligned along +Xr and the robot will move along this axis, C is 
halfway between the two wheels, each wheel is contributing with a half of the total 
robot's speed xr, therefore we have xr = x r \ +x r 2, where x r \ = \r<fi\ (right wheel) 
and x r 2 = \rtyi (left wheel) are the speed of each wheel. For a differential robot 
in which each wheel spins with equal speed but in opposite directions the result 
is a stationary spinning movement and the xr will be zero, and because neither 
wheel can contribute to make sideways motion in the robot's reference frame, jr 
is always zero. Finally, it is necessary to calculate the rotational component Or of 
qR. Since each wheel contributes additively for the final displacement, forward spin 
of the right wheel combined with the opposite spinning of the left wheel results in 
counterclockwise rotation at point C, similarly a clockwise rotation can be obtained 
using the opposite spinning of the wheels. If right wheel spin alone, the robot pivot 
around left wheel, and viceversa. For the counterclockwise case, the rotation velocity 
(0\ at C can be calculated because the wheel is instantaneously moving along the arc 
of circle of radius 21, 

«i = ^ (13) 

For the left wheel, to obtain a clockwise rotation at point C, we have, 

a* = § (14) 

The kinematic model provides information about the motion of a robot using its 
component wheel speeds for simple cases. However, to determine the space of pos- 
sible motions for a determined chassis design, it is necessary to describe formally 
the constrains of the robot that depends on type of each wheel. In order to sim- 
plify the model several assumption must be considered; so there are two important 
constraints: the first one enforces the concept of rolling contact, this means that the 
wheel must roll when motion takes place in the appropriated direction; the second 
enforces the concept of no lateral slippage which means that the wheel must no 
slide orthogonal to the wheel plane. It was demonstrated in [ 19 1 that for an MR with 
M wheels, only the fixed standard wheels and steerable wheels have impact on the 
robot chassis kinematics requiring consideration when computing the robot's kine- 
matics, on the other hand, the castor wheels impose no kinematics constrains on the 
robot chassis. 
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3 Locomotion System Controller 



In this section, the locomotion system controller to regulate the posture of the 
differential MR of Fig. Q] is explained. The controller's goal is to make that the 
difference between the desired pose and the actual pose trend to zero, i.e., 



lim \\q d ~q(t)\\ =0 



(15) 



Fig. |2] shows the controller block diagram. Basically, it consists of two kind of feed- 
back controllers: The Proportional Integral (PI) "DC motor controllers" of the two 
drive wheels; and the "fuzzy pose controller" associated with the kinematic model, 
which sends, as the reference for the PI controller, an estimate value of the adequate 
angular speed for each wheel to achieve the desired goal. 
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Fig. 2. Fuzzy controller to regulate the Pose of a differential mobile robot 



Each drive wheel of the MR uses a Pittman GM9236S025 DC Motor equipped 
with a shaft optical encoder to implement the feedback loop. Fig. [3] illustrates the 
main electro-mechanical components used to drive the DC Motor: For a desired 
speed, and rotation way (CW or CCW), the controller board calculates the duty cycle 
to generate the Pulse Width Modulation (PWM) signal in order to send it to the H- 
bridge together with its configuration to obtain the desired rotation way; then, each 
H-bridge associated with its corresponding motor provides polarization and gives 
enough power to drive the motor. As it was mentioned, the motor speed is regulated 
using a PI controller, to calculate it, a "black-box" mathematical model [ 12 1, that in- 
volves the applied power, H-bridge, and DC gearmotor was obtained. For this task, a 
system identification process was achieved, we used a sampling time of 7i = 100 ms 
and 500 samples, the linearized model (transfer function) is shown in ( fToT l. a 
78.19% of model accuracy was obtained, which from the practical point of view, 
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Fig. 3. Power stage to drive a DC gearmotor using PWM and H-Bridge 

it was enough to achieve the PI controller design and simulations. Fig. 2] shows a 
comparison between the real system and the obtained model. The PI control law is 
given by O, where K p = 0.008 and K t = 0.06. 




Fig. 4. A pseudorandom sequence to achieve the system identification process was used 



G(s) 



351.9 
5 + 4.933 



(16) 



K(s) = K p 



Ki(s) 



(17) 



In Fig. |2j the external control loop contains the pose fuzzy controller. This block 
handles three inputs and two outputs; to simplify, the block uses two fuzzy inference 
systems (FIS): one FIS for handling position in the (X,Y) plane (FIS1), and one to 
control the angle of orientation 9 of the MR (FIS2), see Fig. [5] 
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Fig. 5. Fuzzy systems of the locomotion pose controller. The two FIS infers the reference 
angular speed for right and left wheels. 

3.1 Fuzzy Controller for Pose Regulation 

The objective of this controller is to provide the appropiated reference speeds to take 
the MR from the actual position and orientation to the target pose, Fig. [5] shows with 
more detail the block named "Fuzzy Controller for Pose Regulation", where there 
are two FIS sub-blocks, at the top is the FIS 1 for position control, at the bottom 
is the FIS2 for orientation control. The output of the two FIS are combined and 
synchronized to obtained the required angular speeds of each drive wheel. 



3.1.1 FIS1 for Position Control 

This FIS has two linguistic variables as inputs: the distance "dist" from the actual 
position to the target position, and the alpha angle "a" situated in the reference 
frame, which represents the angle of deviation of the actual position to the target 
position, see Fig. [6] FIS1 has two linguistic variables as outputs: linear speed "v r ", 
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Fig. 6. The MR is disoriented the angle a. First, the MR needs to orientate to the desired 
position, (i.e., a = 0), then it will reach the target position. Finally, it will get the right 
orientation. 



and angular speed "a),", the inference is achieved using the Mamdani approach, 
Table [T] shows the rule base. 

The linguistic variable "distance" has two linguistic terms: Small (S) and Large 
(L) with triangular and trapezoidal shape, respectively; they were defined in the uni- 
verse of discourse [0,4], see Fig. [Taj The numeric value at this input is the euclidian 
distance dist obtained by (fT8l l. 



I I'M I = ^2 + e 2 = ^(x d -x) 2 + (y d -y) 2 



(18) 



Table 1. Fuzzy rule base of FIS1 
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The linguistic variable "alpha" is the angle of orientation that the MR forms be- 
tween the straight line of actual and target position, the input numeric value "a"is 
calculated using ( TT9b , 
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(a) FIS1 input. LV "distance". 
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(b) FIS1 input. LV variable "alpha". 
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(c) FIS1 output. LV "linear speed". 
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(d) FIS 1 output. LV "angular speed". 



Fig. 7. Position controller. Linguistic variabels (LVs) and terms for inputs and outputs of 
FIS1. 



a = atan2(e y ,e x ) — 9 



(19) 



The variable "alpha" has five linguistic terms: Negative Big(NB), Negative Medium 
(NM), Zero (Z), Medium Positive (MP), and Large Positive (LP). They are repre- 
sented by three triangular membership function (MF), and two trapezoidal MFs de- 
fined in the interval [—2jt, 1n\ radians to represent a whole counterclockwise (CCW) 
turn, or a clockwise (CW) turn. Fig. [7b] shows the MFs of each term. 

FIS1 has two outputs, v r and ft),-. The linguistic variables are "linearSpeed" and 
"angularSpeed", that provide an estimate of linear and angular speed, their MFs and 
terms are shown in Fig.|7c]and Fig.[7d] respectively. 

3.1.2 FIS2 for Orientation Control 

The goal of this Mamdani FIS is to face the MR in the desired pose, it uses one 
linguistic variable as input, and one as output. Fig. [8a] shows the input linguistic 
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variable "theta error" with its linguistic terms: Negative Big (NB), Negative Medium 
(NM), Zero (Z), Positive Medium (PM), and Positive Big (PB). Fig. l8bl shows the 
output linguistic variable and terms, note that term names and ranges are the same 
of Fig|7d]for FIS 1 . Table |2] shows the rule base. The "theta error" at time / is given 
by dH). 



) {t) = e d {t)-e(t) 



(20) 
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(a) FIS2 input. LV "theta error" 



(b) FIS2 output. LV "angular speed". 



Fig. 8. FIS2. Orientation control. 



Table 2. Fuzzy rule base of FIS2 
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3.1.3 Output Conversion and FIS Synchronization 

Fig. [2] shows that the outputs of the fuzzy controller for pose regulation" are refer- 
ence values of angular speeds for the right wheel "ft) rr " and left wheel "0)/ r " that the 
PI controllers will use as input reference to control the angular speed of each wheel. 
Note that FIS 1 nor FIS2 provide directly these speeds, so we used (f2TT> and (1221 in 
order to obtain the desired angular speeds. 



(0„ 



v r + (b/2)co r 



(21) 
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v r ~(b/2)0) r 

COlr = (22) 

r 

The algorithm implemented in the Fuzzy controller block for pose regulation has 
two steps synchronized with the help of the multiplexer (MUX) that is shown in 
Fig. 12 they are explained using Fig|6l The steps are: 

1 . Step 1. Using FIS 1 , the MR is oriented from the actual position and orientation 
to the target position regardless final orientation, in this step the MR rotates the 
a angle showed in Fig.0 and it moves to the target position. 

2. Step 2. Once the MR has reached the target position, using FIS2 the MR is 
oriented according the target pose, this is illustrated in Fig|6]as the angle Oj. 



4 Experimental Results 

We present two representative experiments that were performed. The basic idea is to 
propose the initial and target poses. The general characteristics for both experiments 
are the next: 

1. The drive wheel of the MR has a radius r = 0.08 meters. 

2. The distance between the drive wheels is b = 0.45 meters. 

3. The MR platform has a reference point C displaced a distance d — from the 
driving wheels. 

4. The initial pose is q = (0.1,0.1,0) r . 

Experiment 1. 

The MR wants to reach the pose given by 

^/ = (0.9,2,0.5) r (23) 

The steps explained in section U. 1.31 are followed. Fig. [9a] and Fig|9blshow the time 
in seconds that the MR lasted to orientate and arrive to the desired position (target), 
this process was achieved using FIS 1 . Fig.|9c]shows the time and maximal error that 
the MR lasted to achieve the desired orientation. Fig. llOal Fig. llObl and II Pel show 
the error evolution in time for positioning the MR in the desired position (xd, yd) 
and orientation Qj. 

Experiment 2. The MR wants to reach the pose given by 

q d = (2,0.5, ^) T (24) 
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Fig. 9. Experiment 1. Evolution in time of the MR to achieve the target pose qj 
(0.9,2,0.5) r . 



Similarly to experiment 1 , we followed the algorithm explained in section 13.1.31 
Fig. Illal and Fig lllbl show the time that the MR lasted to orientate and arrive to 
the desired position (target), this process was achieved using FIS1. Fig. II lcl shows 
the time and maximal error that the MR lasted to achieve the desired orientation. 
Fig. ll2al Fig. ll2bl and !12cl show the error evolution in time for positioning the MR 
in the desired position [xd,yd) and orientation Q c \. 
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Fig. 10. Experiment 1. Initial pose qo = (0.1,0.1,0) r , target pose q c [ = (0.9,2,0.5) 
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5 Conclusions 



The development of a locomotion controller to regulate the pose of a MR that uses 
a Fuzzy Control block as the core of the controller was presented. The proposed 
controller works with the kinematic model and the MR architecture, this charac- 
teristic allows to implement the controller without the need of using the system dy- 
namic model. A controller requirement is to know the actual position and orientation 
(pose), and the desired pose in order to reach it. It was supposed that the MR has 
the adequate onboard equipment to provide the controller model with the different 
positions and orientation when it is required. Computer simulations confirms that 
the performance of the locomotion controller satisfied the controller's goal imposed 
by lim t ^,ov\\qd — q(i)\\ = 0, with a no significant error with a fast time response for 
the selected motors. This controller is a step further in autonomy in the sense of 
independence since its core is a fuzzy system based on rules derived from experi- 
ence, instead of using analytical mathematical models to generate the control law. 



462 O.M. Ross et al. 

Portability is an important characteristic of this proposal since modifications to the 
MR architecture impose only small program changes. 
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Abstract. This work deals with the embedding architecture into an FPGA of a fuzzy 
locomotion controller for pose regulation of a differential nonholonomic mobile 
robot. It is presented an standardized design based on the actual and target pose, 
it does not need any dynamic model to work, the design provides the estimate an- 
gular speeds, then using the kinematic model, a feedback of the actual position is 
provided, hence the same system can be used by different mobile robots consider- 
ing that the speed control of driving wheels is a subsystem and a configuration is 
provided. These features makes this proposal viable to be used by the automotive 
industry in the automatic steering system for self-parking for different car mod- 
els with differential tracking. The controller was developed using VHDL code, its 
functionality is simulated by merging the code into the Simulink environment. The 
experimental framework, experiments and results are explained. 

1 Introduction 

During several years, the problem of motion planning of nonholonomic robots have 
been studied from different perspectives because they offer several interesting prob- 
lems to researchers (U El. In most of the cases, a nonholonomic mobile robot (MR) 
has a limited steering angle, rectangular shape, frontal or rear driving wheels; in gen- 
eral, it has similar features than a car. Recently, automotive industry also has put its 
attention to this kind of MR, such is the example of new cars with automatic steering 
for self parking. This work is focused in the development of a locomotion controller 
for pose regulation embedded into an FPGA, that works with the kinematic model, it 
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does not need any complicated mathematical dynamic model since its functionality is 
based on the system architecture. This controller is a step further in autonomy in the 
sense of independence because its core is a fuzzy system based on expertise knowl- 
edge. The controller only needs to know the actual pose (position in x, y and the angle 
of orientation) and the target pose. Embedding a controller with such characteristics 
offers to designers the possibility of developing an standard dedicated computer for 
controlling cars movement; i.e., nonholonomic mobile robots Q. 

There are several previous works that deal with this problem using different ap- 
proaches, in Q was presented a partial listing of C-code for solving the problem 
known as "truck backer-upper control" for implementation into a reconfigurable 
FPGA system. In Q, a new version of the Xfuzzy design environment is presented 
using as an application example, the self parking of a car problem. In [10], the 
design and synthesis of a mobile robot controller using fuzzy logic for reactive 
control was presented. Other works that use fuzzy logic in this field are lfT4ll lfl5l . 
The proposal presented in this work is fairly different to previous works in several 
aspects. 

This work has five main sections, the first section corresponds to the present 
introduction. In section two the locomotion system is described, this section begins 
explaining the kinematic model, following with a description about the instances 
used to build the Fuzzy Pose Controller (FPC) for FPGA implementation. In section 
three, the experimental platform to test the proposal is explained. In section four, the 
experiments and results are commented. Finally in section five are the conclusions. 

2 Locomotion Controller of a Nonholonomic Wheeled MR 

In Fig.Q]the structure of a differential nonholonomic MR with two connected trac- 
tion wheels in the front and one unpowered swivel castor wheel in rear is shown. 
In this figure, there are two different reference frames: the global reference frame 
{X,F} to position the MR in the world from some origin O : {X,T}, and the 
robot's local reference frame {Xr,Yr} that maps a reference point C in the robot's 
chassis considering its orientation. In this way, the point C in the global frame is 
given by coordinates x c and y c , and the angular difference between global and lo- 
cal frames is given by 6, using these three values the pose of a robot is defined 

by, 

q={x c ,y c ,df (1) 

The robot of Fig. [T]has three degrees of freedom in the plane, x c and y c for position, 
and 6 for orientation; however, only two are controllable, it can go to the front 
or to the rear applying equal speed to the driving motors, by the application of 
differential speeds the angle is obtained. The nonholonomic constrains states that 
the robot can only move in direction normal to the axle of the driving wheels, the 
slipping condition is not allowed. In Fig.[2]the locomotion controller to regulate the 
posture of an MR such as the shown in FigQ] is illustrated. Basically, it has two kind 
of feedback controllers: The Proportional Integral (PI) "DC gearmotor" controllers 
of the two drive wheels, and the Fuzzy Pose Controller (FPC) associated with the 
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Fig. 1. A nonholonomic mobile robot. The reference point C can be shifted a distance d, i.e, 
the point Q ; in our case, d = 0. 
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Fig. 2. Fuzzy locomotion system to regulate the Pose of a differential mobile robot 



kinematic model. The objective of the FPC is to estimate the angular speed values 
for each drive wheel that will be used by the PI controllers as the reference input. 
The goal of the locomotion controller is given by, 



\\.m\\q d -q(t)\\ = 



(2) 



Each wheel of the MR uses a Pittman GM9236S025 DC gearmotor equipped with 
a shaft quadrature optical encoder to implement the feedback loop, the speed is 
regulated using Pulse Width Modulation (PWM) and the power stage is handled 
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by an H-bridge. The desired speed and rotation way, clockwise (CW) or 
counterclockwise (CCW), is calculated by the onboard controller that sends the ap- 
propriated PWM signal and H-bridge configuration. To calculate the PI controller 
was necessary to obtain a "black-box" mathematical model [11), that involves the 
applied power (produced by the PWM signal), H-bridge, and DC gearmotor. To 
achieve the system identification, a sampling time of T s = 100 ms and 500 sam- 
ples were used, the linearized model (transfer function) is shown in (O, a 78.19% 
of model accuracy was obtained, which from the practical point of view, it was 
enough to achieve the PI controller design and simulations. The PI controller law is 
given by Q, 



5 + 4.933 



u(t) = K p e(t) 



Kt 



e(t) 



(4) 



where K p = 0.008, and K\ = 0.06. The output of each PI controller is the duty cycle 
of the PWM signal. The actual angular speed of each drive wheel is calculated 
using the optical encoder, it serves as the controller feedback as well as input to the 
matrix S(q) that transforms velocities v in the MR base coordinates, to velocities q 
in Cartesian coordinates. The speed is calculated using )5) |6| 1 13), 



q = S(q)\(t) 



(5) 



where the matrix S(q) for a nonholonomic MR with a reference point C positioned 
in the center of the drive wheel axle (i.e., d — 0) is given by, 



therefore, 



cose 

sinO 

1 



S(q) 



cose o 

sine 
1 



where v 



and q 



(6) 



(7) 



The locomotion pose controller uses the "pose error" signal denoted by e q , 



e q {t)=qd-q{t) (8) 

where, q is obtained integrating equation 0. Fig. [3] shows the FPC architecture 
with more detail. It consists of two Fuzzy Inference Systems (FIS 1 and FIS2), one 
multiplexer (MUX), and two functions, equations (0 and (fTOb . that converts linear 
and angular speeds to angular reference speeds for right (O rr , and left (0[ r , wheels. 
The MUX and its selector input, coordinates the two FIS actions to reach the de- 
sired pose; hence, the selector input allows to achieve the next two coordination 
steps: 
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Step 1. Using FIS1, the MR is oriented from the actual position and orientation 
to the target position regardless final orientation, in this step the MR rotates the 
a angle showed in Fig.lU and it moves to the target position. 
Step 2. Once the MR has reached the target position, using FIS2 the MR is 
oriented according to the target pose, this is illustrated in Fig.|4]as the angle Q c \. 



(Q„ 



Oil r 



V r + (b/2)(O r 



V r -(b/2)C0r 
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(10) 



Fuzzy Pose Controller (FPC) 



distanos(8;1 ) vsal(8: 1 ) 

alpha{8:1) FIS1 coded in VHDL 

, 6 (mamdani) 



wsal(S:1) 



IV 



thetaError(8:H 



al(S:1) 



elk 
rat 



FIS2 coded in VHDL 
(mamdani) 



Ul r 



Selector 



0,,-fJ 



a = nt(in2{-) 



.,*,,* 



(■) 



6 



— ► 



< from J q 



q,i = (;c,i, y,i, Oj) 

Fig. 3. Fuzzy systems of the locomotion pose controller. The two FIS infers the refer- 
ence angular speed for right and left wheels. Signal ce, elk and rst appear not connected, 
however they need to be connected. Signal ce enables or disables the instances of FIS1 
and FIS2. Signal rst provides initial conditions to the instances (hardware) in the FPGA. 
Signal elk synchronizes the different stages of the FIS: fuzzification |12|, inference [9| and 
defuzzification [4). 
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Fig. 4. The MR is disoriented the angle a. First, the MR needs to orientate to the desired 
position, (i.e., a = 0), after that, it will reach the target position. Finally, it will get the right 
orientation. 

Table 1. Position controller (FIS1). It has two inputs and two ouptuts 



Linguistic variable 


Linguistic terms 


FIS input 


FIS output 


Name 


Type 


Parameters 


distance 




Small (S) 


Triangular 


[0,0,0.5] 


Large (L) 


Trapezoidal 


[0,0.5,4,4] 


alpha 




Negative Big (NB) 


Trapezoidal 


[-n,-n, -1,-0.5] 


Negative Medium (NM) 


Triangular 


[-1,-0.5,0] 


Zero (Z) 


Triangular 


[-0.5,0,0.5] 


Positive Medium (PM) 


Triangular 


[0,0.5,1] 


Positive Big (PB) 


Trapezoidal 


[0.5 A, k, k] 




linear speed, v r 


Negative Big (NB) 


Trapezoidal 


[-0.8,-0.8,-0.2,-0.1] 


Negative Medium (NM) 


Triangular 


[-0.2,-0.1,0] 


Zero (Z) 


Triangular 


[-0.1,0,0.1] 


Positive Medium (PM) 


Triangular 


[0,0.1,0.2] 


Positive Big (PB) 


Trapezoidal 


[0.1,0.2,0.8,0.8] 




angular speed, ft) r 


Negative Big (NB) 


Trapezoidal 


[-3,-3,-1,-0.5] 


Negative Medium (NM) 


Triangular 


[-1,-0.5,0] 


Zero (Z) 


Triangular 


[-0.5,0,0.5] 


Positive Medium (PM) 


Triangular 


[0,0.5,1] 


Positive Big (PB) 


Trapezoidal 


[0.5,1,3,3] 



FIS1 for Position Control 

This FIS has two linguistic variables as inputs: the distance dist from the actual 
position to the target position, and the alpha angle a situated in the reference frame, 
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which represents the angle of deviation of the actual position to the target position, 
see Fig. [4] FIS1 has two linguistic variables as outputs: linear speed v r , and angular 
speed ft),-, the inference is achieved using the Mamdani approach, in 13 J4| |[T2ll 
is explained how to code in VHDL the different stages of a FIS, i.e.; fuzzification, 
inference, and defuzzification. The parameters of linguistic variables and terms are 
described in TableQ] in Table [2]fhe rule base is shown. 

The numeric value of the input linguistic variable dist is the euclidian distance 
obtained by (fTTT) . 

| \dist\\ = ^e\ + e] = ^{x d - xf + (y d - y) 2 (11) 

The linguistic variable a represents the angle of orientation that the MR forms be- 
tween the straight line of actual and target position, the input numeric value a is 
calculated using (TT2b . The universe of discourse is the interval [— 2n,2n] radians to 
represent a whole counterclockwise (CCW) turn, or a clockwise (CW) turn. 

a = atan2(e y ,e x ) — (12) 

FIS2 for Target Orientation 

Once the MR has reached the target position in (X,Y), the goal of FIS2 is to face 
the MR in the desired pose. It has one input and one output, the associated lin- 
guistic variable are "theta error" and "angular speed", respectively. In Tableware 
the input and output linguistic variables, their terms name, shapes, and parame- 
ters are also shown. In Table [4] the FIS2 rule base is given. The "theta error" at 
time t is the difference between desired angle and the actual angle, see equation 

e e (t) = e d (t)-e(t) (13) 



3 Experimental Platform 

In this section, the methodology that was carried out to test and validate the locomo- 
tion controller of Fig. [2] and its corresponding VHDL implementation is explained. 
In all the experiments we consider the Spartan 3 FPGA as the target system, so we 
used as the main development software tools: the Xilinx Integrated Software En- 
vironment (Xilinx ISE) to develop the VHDL code of the FPC, the Xilinx System 
Generator (XSG) to migrate the VHDL code to the Simulink/Matlab environment, 
and the Simulink. The idea is to design first the system in Simulink, then in VHDL 
and compare both systems. Next we comment the main steps. 

1. Design the FPC block of Fig. fusing Simulink/Matlab tools. We called this 
design "FPC-smk". It contains the fuzzy systems for position (FIS1), and for 
orientation (FIS2). Fig. [5a] shows how to test FIS1, Fig. [5b] shows how to test 
FIS2; in Figures[6al [6b] and[6c]are the surface control obtained with these two 
last models. 
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Table 2. Fuzzy rule base of FIS 1 





alpha 




NB 


NM 


Z 


MP 


LP 


distance 


S 


V r 


VNM 


VPM 


vz 


VPM 


VNM 


CO,- 


WNB 


WNM 


wz 


WPM 


WPB 


L 


IV 


VNB 


VPB 


VPB 


VPB 


VNB 


CO,- 


WNB 


WNM 


WZ 


WPM 


WPB 



Table 3. Orientation controller (FIS2). It has two inputs and two outputs, whose linguistic 
terms have "W" as the first letter to indicate "angular speed". 



Linguistic variable 


Linguistic terms 


FIS input 


FIS output 


Name 


Type 


Parameters 


theta error 




Negative Big (NB) 


Trapezoidal 


[-n,-n, -1,-0.5] 


Negative Medium (NM) 


Triangular 


[-1,-0.5,0] 


Zero (Z) 


Triangular 


[-0.5,0,0.5,] 


Positive Medium (PM) 


Triangular 


[0,0.5,1] 


Positive Big (PB) 


Trapezoidal 


[0.5,1,7T,7T] 




angular speed, ft),. 


Negative Big (WNB) 


Trapezoidal 


[-3,-3,-1,-0.5] 


Negative Medium (WNM) 


Triangular 


[-1,-0.5,0] 


Zero (WZ) 


Triangular 


[-0.5,0,0.5] 


Positive Medium (WPM) 


Triangular 


[0,0.5,1] 


Positive Big (WPB) 


Trapezoidal 


[0.5,1,3,3] 



Table 4. Fuzzy rule base of FIS2 



theta error 


C0 r 


NB 


WPB 


NM 


WPM 


Z 


WZ 


PM 


WNM 


PB 


WNB 



Design the Locomotion Pose Controller (LPC) of Fig. fusing Simulink/Matlab 
tools, include the "FPC-smk" module of step 1. Fig. [7J shows this design. We 
called this module "LPC-smk". Test the module and save results for future 
comparisons. 

Codify in VHDL the FPC block of Fig.[2j Import the design to Simulink through 
the XSG, this module was called "FPC-vhdl". See Fig.|H]for testing the VHDL 
module. Figures I9all9bl and l9cl show the surface control for this codification. 
In the LPC Model (designed in step 2) of Fig. [71 include the module "FPC- 
vhdl", instead of the "FPC-smk" module. We called this module "LPC-vhdl". 
Test the module, and save results to achieve comparisons against results 
obtained in Step 2. 
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FIS1 
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- - : 
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(a) Model to test FIS1 



(b) Model to test FIS2 



Fig. 5. Simulink models of the FPC block, FIS 1 is for positioning the MR, and FIS2 for ori- 
entation. The outputs of the "FPC-smk" module are linear and angular speeds. These models 
were designed using only native Simulink blocks. 



4 Experiments and Results 

Several comparative experiments were achieved. They were be divided in two 
groups: 

1. Validation of the "FPC-vhdl" module. We compare results of the models "FPC- 
smk" vs. "FPC-vhdl", i.e., Figures [6] and [9] were used to analyze differences in 
the control surfaces, the obtained results are shown in Fig.[lO] We used a lenght 
word of eight bits. Error surfaces are shown in FigfTUl in general the error is 
small, it can be reduced incresing the lenght word. 

2. Validation of the "LPC-vhdl" module. Similarly, we used results obtained using 
models constructed using only native Simulink blocks "LPC-smk" model, to 
contrast results against those obtained using the "LPC-vhdl" model. 



Validating the "LPC-vhdl" Module 

To validate the LPC system coded in VHDL ("LPC-vhdl") several experiments were 
performed. Here, we show one representative experiment, the results are compared 
with the equivalent model that was developed with native blocks of Simulink ("LPC- 
smk"). The initial conditions of the MR are qo = (0.1,0.1,0) , the wheel's radio is 
r = 0.08m, and the axle of driving wheels is 0.45m 

The goal of the experiment is to reach the pose qj — (1.5,0.75, j) . We used 
the model "LPC-vhdl" that includes the FPC coded in VHDL, Fig. Qj] shows with 
a dashed line the system response given by the module in VHDL "LPC-vhdl"; here 
Figures fTTal and fTTbl show the evolution in time to achieve the desired (x,y) position, 
in Fig. Illcl the evolution in time to achieve the final orientation is plotted. Using 
a solid line, in these figures also are shown the results obtained with the model 
"LPC-smk". The position and orientation errors are shown in Fig [12] Differences 
are mainly due to length word, they can be reduced incresing it, however for most 
of practical applications they are not significant. 
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(a) Linear speed (FIS1) 



(b) Angular speed (FIS1) 




(c) Angular speed (FIS2) 
Fig. 6. Control surfaces obtained with the Simulink model of Fig. [3] 
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Fig. 7. Simulink model of the LPC. The structure of this model is the same for testing the 
"FPC-smk" and the "FPC-vhd" modules. Note that in essence, this is the Simulink imple- 
mentation of the block diagram showed in Fig. [2] Therefore, in this figure, the first block 
labeled as "FPC" is the the Fuzzy Pose Controller module, here we can test the "FPC-smk" 
or the "FPC-vhdl" modules. In the second block labeled as "Mobile Robot" are the internal 
control loops illustrated in Fig. [2] The third block labeled as "Kinematic Model" contains the 
transformation matrix S(q) and the integral operation J q to obtain the actual pose q. Below, 
there are two Matlab function blocks, distance and alpha to achieve operations given by jilt 
and d!2b . The other blocks are just viewers to observe results. 
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Fig. 8. Simulink models of FIS 1 and FIS2 coded in VHDL. Both models, according to Fig. [3] 
conform the "FPC-vhdl" model. 
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(c) Angular speed (FIS2) 

Fig. 9. Control surfaces of the "FPC-vhd" module. The two FIS coded in VHDL were im- 
ported to Simulink in order to evaluate them. To obtain the surfaces the modules showed in 
Fig.[S]were used. 
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Fig. 10. Error plots, they were obtained subtracting control surfaces of models "FPC-smk" 
and "FPC-vhdl". The FPC-vhdl module was coded in VHDL using a lenght word of 8 bits. 
The plots show that the error between the surfaces of the module in VHDL against the module 
in Simulink that uses floating point numeric format is small. By increasing the length word 
in VHDL codification the differences between surfaces will be reduced. 
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Fig. 11. VHDL code was imported to the Simulink environment. Although an inference into 
an FPGA can last some nanoseconds, the experimental plant (MR) has a slow time response, 
hence the times showed in the plots. The code into an FPGA is highly parallelized, when it is 
migrated to Simulink, a typical PC cannot reach the same granularity. Also differences in time 
to reach the goal (position x.y), and precision are due to modeling errors and length word. 
The important thing is that the "LPC-vhdl" model showed consistency in all the experiments. 
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(b) Angular speed (FIS1) 




(c) Angular speed (FIS2) 

Fig. 12. Performance comparison of modules "LPC-smk" and "LPC-vhdl". Note that the 
module coded in VHDL introduces an small error in orientation; however, it can be reduced 
increasing the length words to represent numbers. 

5 Conclusions 



This work dealt with the develpment of a VHDL module of a fuzzy pose controller 
(FPC-vhdl), that can be a susbsystem of a locomotion controller embedded into an 
FPGA. Based on the FPC, the complete locomotion controller for pose regulation 
(LPC) was developed, it has the characteristic that does not need any complicated 
dynamic model to work; hence providing the model with the adequate speed mo- 
tor controller it can work for several applications. This feature makes the system 
viable for the development of an application specific integrated circuit (ASIC), that 
can be used in diverse application; for example in automotive industry for auto- 
matic steering for self-parking for cars with differential tracking. In general, the 
differences (errors) between the proposal coded in VHDL and the results obtained 
with Simulink were small, and expected since we used a length word of eight-bits, 
whereas Simulink uses floating point. 
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